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(57) In a video-coding and video-decoding device, 
motion-compensated interframe prediction is achieved 
by determining motion vectors of respective pixels ac- 
cording to representative motion vectors with weighting, 
and determining accurate motion vectors between video 
frames, dividing each frame into areas optimal to an ob- 
jective figure, size and location and performing effective 
encoding and decoding of the motion vectors. According 
to the invention a motion-compensated interframe pre- 
dicting portion (217.1217) generates a predicted video- 
frame by varying codable area according to a reference 
video-frame received from a frame memory portion 
(216) and an input video frame and obtains side-infor- 
mation. A motion vector searching portion (232) search- 
es a motion vector. An effective-area selecting portion 
(231) selects a valid or invalid mask depending upon a 
position of a processable object, divides a processable 
area of the input video frame into suitable areas, a var- 
iable-area predicted-frame generating portion (233) 
generates a predicted frame by affine transformation 
and translational displacement. An area-diving pattern 
deciding portion (234) outputs side-information such as 
the predicted image, motion vectors and divided areas, 
a side-information coding portion (235) encodes an ad- 
ditional motion vector as a difference from an average 
basic-motion vector or predictively encodes a motion 
vector from a median value of three neighboring motion- 
vectors. A difference between the input video-frame and 
the predicted video-frame from the predicting portion 
(217. 1217) is encoded, transferred and stored. 
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Description 



BACKGROUND OF THE INVENTION 

5 The recent progress of high-speed digital signal processing systems with advances in LSI te'chnology and devel- 

opment of video processing technique has meet an increasing interest for more effective usage of video information. 
In the field of telecommunications, construction and development of digital networks that is represented by ISDN (In- 
tegrated Sen/ice Digital Network) have realized video communications services such as videophones, video confer- 
encing services, video data base sen/ices and so on. Furthermore, the spread of mobile communications networks 

10 with advances in digital information technology has increased users' need to realize mobile video communications 
services. 

It is impractical to handle video information because the video information generally contains a very large amount 
of information. However, it is possible to reduce the amount of video information by reducing redundancy of the video 
information. High efficient video compression techniques are particularly important to the mobile communications net- 
IS works that have a small amount of transmission line capacity By this reason, international organizations ITU-T and 
ISO/IEC have energetically work to settle international standards on video coding methods for encoding video signal 
at a very low bit- rate. 

A video signal contains time-sequentia! information such as a change of motion in picture and spatial information 
concerning a content of one frame (video frame or video field; both will be hereinafter called a frame), each of them 
20 has a redundancy. A motion compensative interframe prediction with orthogonal transform encoding method that has 
been preferably used is such that the temporal redundancy of information is first reduced through the interframe pre- 
diction with motion compensation. According to the principle of the motion compensative interframe prediction encoding 
method, the motion compensative interframe predicting portion prepares predicted value of an input video signal from 
an already encoded signal stored in a frame memory and outputs a difference between the predicted value and the 
25 input video signal as a prediction error signal. A prediction error signal encoding portion encodes the prediction error 
signal by orthogonal transforming method to further reduce spatial redundancy of the information. The encoded pre- 
diction error signal is locally decoded, then stored in the frame memory and is used for prediction of a proceeding frame. 

In very low bit-rate video encoding, it is necessary to represent a video-signal with a very small amount of infor- 
mation. An amount of information allocated to the orthogonal transform encoding (i.e., prediction error encoding), there- 
to fore, is considerably limited. Accordingly, it is very important to improve the efficiency of the interframe prediction by 
using an interf rame prediction method that can more correctly predict a change of a video signal with time. 

In the latest years, many studies have been made on interframe prediction methods using affine transformation 
and bilinear transformation. While the motion compensative interframe prediction method represents a movement 
contained in a video as a translational motion by using a motion vector per unit-area, the method using the affine 
35 transformation and bilinear transformation can more accurately represent a movement in a video, I.e., with increased 
prediction efficiency since it can represent movement, rotation, enlargement and deformation. 

An interframe predicting portion of the conventional video-coding device comprises a frame memory portion for 
storing already coded video signals, a motion vector detecting portion for determining a representative motion vector 
per unit area from a difference between an input video signal and a video signal read from the frame memory portion, 
40 a motion vector interpolating portion for determining a motion vector per pixel from the representative motion vector 
and a pixel value predicting portion for preparing a predicted video signal from a video signal read from the frame 
memory portion by using the motion vector per pixel. 

The operation of the above-mentioned interframe predicting portion is as follows: 

The frame memory portion stores already coded video signals as reference video frames for interframe prediction. 

45 The motion vector detecting portion receives an input video-frame signal to be encoded and reads a reference video 
frame stored in the frame memory. The motion vector detecting portion divides the coding video frame into unit-areas 
and scans the reference video frame to find an area most similar to a current coding unit-area. A displacement of a 
unit-area of the coding video-frame from the area found in the reference video frame is outputted as a motion vector 
The motion vector is a representative motion vector representing a interframe displacement of a representative point 

50 within a unit-area (usually, a center of the unit-area). The relationship between a representative point and a vector 
searched unit-area are specified. In searching a similar area in a reference video frame, a sum of differential absolute 
values or a square-sum of differential values of each pixel in a unit-area is used as a scale of similarity Furthermore, 
a center portion of the unit-area may be weighted by multiplying a differential pixel value at the center portion of the 
unit-area by a large coefficient and a differential pixel value at the periphery thereof by a small coefficient prior to 

55 summation so as to more accurately determine a displacement of the representative point. 

The representative motion vector is transferred to the motion vector interpolating portion which in turn determines 
motion vectors for respective pixels using the received representative motion vectors. For affine transfomnation. a 
motion vector for each pixel existing within a triangle area surrounded by three neighboring representative points (here- 
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inatter called transformable unit-area) is deternained by solving an affine transformation expression from the represent- 
ative motion vectors of the respective representative points. For bilinear transformation, a motion vector for each pixel 
existing within a quadrangular area surrounded by four neighboring representative points (hereinafter called transform- 
able unit-area) is determined by solving a bilinear transformation expression from the represent3tive motion vectors 
5 of the respective representative points. For a transformation unit -area being quadrangular or rectangular, it is equivalent 
that a motion vector value of each representative point is distributed in a vertical direction and a horizontal direction in 
proportion to an a distance between a remarked pixel and a representative point. 

The pixel value predicting portion receives the (remarked) pixel motion vectors inputted pixel by pixel and, con- 
sidering the motion vectors to be a displacement of the remarked pixel from its corresponding position in the reference 
10 frame, reads a pixel value at the corresponding position from the frame memory as a predicted value of the remarked 
pixel to compose a predicted frame. If the pixel value indicates a position in the reference frame, where no pixel exists, 
e.g., the pixel motion vector (displacement) is a decimal fraction, a neighboring pixel value in the reference frame is 
read-out from the frame memory and a predicted value of the remarked pixel is determined as an interpolating value 
according to a known bilinear Interpolation method. 
IS In a simplified structure of an interframe predicting portion of a conventional video-decoding device using affine 

and bilinear transformation method, the interframe predicting portion of the conventional decoding device comprises 
a frame memory portion for storing already decoded video signals, a motion -vector interpolating portion for determining 
a motion vector of each pixel from a representative motion vector inputted for each unit-area, a pixel value predicting 
portion for preparing a predicted video signal from a video signal read from a frame memory by using the pixel motion 
20 vectors. 

The conventional video coding and decoding devices may show an excellent performance if an entire area sur- 
rounded by representative points can be represented by the same parameters of affine transformation and bilinear 
transformation. However, in the case of positions of the representative points being different from changes between 
video frames, e.g.. positions and movement of an object, representative motion vectors of the representative points 

25 represent different movements of different frame object resulting in that a motion vector of a pixel, obtained from the 
representative motion vector, can not represent an adequate displacement of the pixel. Consequently, the conventional 
art device in this case may suffer a considerably decreased efficiency of interframe prediction, i.e., coding efficiency. 

Video communication services, e.g., videophones and video conferencing have been realized over high-speed 
digital communication networks such as ISDN (Integrated Service Digital Network). 

30 Recently, with the spread of radio transmission networks represented by PHS (Personal Handyphone System), 

progress of data modulation/demodulation techniques for PSTN (Public Switched Telephone Network) and advance 
of Image compressing techniques, there have been increasing demands for video communication services over lower 
bit-rate transmission lines. 

Video communication services like video telephones and a video conferencing system must transmit a very large 
35 amount of Information over limited transmission lines. Accordingly it is necessary to reduce an amount of information 
by compressing and to encode the video information In view of transmission speed and costs of usable transmission 
lines. 

As well known, H.261 . I\/IPEG-1 (Moving Picture Coding Expert Group) and MPEG-2 are internationally established 
standard coding methods for compressing video information. In addition, l\^PEG-4 Is now promoted to be standardized 
<o a method for coding at a very low bit-rate of not more than 64 kbps. 

The standardized video-coding methods adopt a hybrid video coding method performing interframe prediction 
coding In combination with intraframe prediction coding. 

The interframe prediction coding is to generate a predictive video-frame from a reference video-frame and encode 
a difference of the predictive frame from a current video-frame to reduce an amount of codes to be transmitted. This 
45 enables using effective use of transmission lines. 

The Interframe predictive coding is known as be a block-matching method, an affine transformation method and / 
a warping prediction method. A conventional video-coding and video-decoding device using the affine transformation I 
will be described as follows: 

The operation of the conventional video-coding device is first explained below. 

It is assumed that a reference video frame usable for producing a predictive video-frame is stored in a frame 
memory portion in a normal operating condition of the conventional video-coding device for conducting the motion- 
compensated interframe predictive coding. 

An input video frame enters into a subtracting portion and a motion-compensated interframe-predicting portion. 
The motion-compensated interframe-predicting portion determines a predictive motion from the reference video-frame 
55 stored in the frame memory portion and the input video-frame and outputs a predicted video-frame to the subtracting 
portion. 

The subtracting portion subtracts from the input video-frame the predicted video-frame entered by the motion- 
compensated Interframe predicting portion and outputs the subtraction result, i.e., predicted error information to a video 
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coding portion. 

The video coding portion spatially transforms the input predicted-error-information, e.g.. by using OCT (Discrete 
Cosine Transform), quantizes the transformed information and outputs coded video-information. At the same time, the 
coded video-information outputted from the video coding portion is locally decoded by a video<iecoding portion and 
s is then transferred to the adder portion. 

The adder portion adds the predicted video-information outputted from the video-decoding portion to the predicted 
error-information outputted from the motion -compensated intertrame-predicting portion to form a new reference video- 
frame which is then transferred to the frame memory portion. 

The frame memory portion stores therein the new reference video-frame outputted from the adding portion. This 
10 new reference video-frame is given to the motion-compensated interframe predicting portion when encoding a suc- 
ceeding input video-frame. 

The video coding device outputs a series of coded video information (predicted error information) and a series of 
coded side-infomaation by repeating the above-mentioned sequence of operations. 
The operation of the conventional video-decoding device is now described below. 

IS It is assumed that a reference video-frame usable for producing a predictive video-frame is stored in a frame 

memory portion in a normal operating condition of the conventional video<oding device for conducting the motion- 
compensated interframe predictive coding. Coded video-information inputted into the video-decoding device enters 
into a video-decoding portion which decodes the coded-video information by performing the same operations as the 
video-decoding portion of the video-coding device and outputs the obtained differential video-frame to an adder portion. 

20 Coded side-information inputted into the video-decoding device enters into a motion-compensated interframe pre- 

dicting portion. 

The motion-compensated interframe predicting portion decodes the received coded side-information and obtains 
motion vectors, then it produces a predicted video-frame by using the obtained motion vectors and a reference video- 
frame read from the frame memory portion and transfers the produced predicted video-frame to an adder portion. 

25 The adder portion obtains an output video-frame by adding the differential video-frame from the video-decoding 

portion to the predicted video-frame from the motion-compensated Interframe predicting portion. The output video- 
frame is outputted from the video-decoding device and at the same time is transferred to the frame memory. 

The frame memory stores therein the video-frame inputted from the adder portion as a new reference video-frame 
that will be used by the motion-compensated interframe predicting portion when decoding a proceeding video-frame. 

30 The video decoding device realizes outputting a series of decoded video-frames by repeating the above-mentioned 

sequential operations. 

The operation of a motion-compensated interframe predicting portion in a video-coding device and a video-decod- 
ing device and prior arts applied therein will be described below. 

An exemplified structure of a motion-compensated interframe predicting portion and its operation will be first de- 
3S scribed as follows: 

The motion-compensated intertrame-predicting portion comprises a motion-vector searching portion, a fixed-area 
predicted-frame generating portion and a motion-vector coding portion. 

The motion-vector searching portion searches motion vectors in an input video-frame and a reference video-frame 
read from a frame memory and outputs the detected motion-vectors to the fixed-area-size predicted-frame generating 
40 portion. 

When searching motion vectors, the motion-vector searching portion searches a motion vector of a control grid 
point by previously weighting a center pixel of a processable area and outputs a motion vector of each control grid 
point to the fixed-area-size predicted-frame generating portion. 

According to the conventional method, a processable area has a fixed size with control grid points set one for 16 
45 or 8 pixels. 

The fixed-area-size predicted-frame generating portion performs interframe prediction on each processable area 
having a fixed size (generally in a rectangular block of 16 x 16 pixels) by using motion vectors inputted from the motion 
vector searching portion and a preceding video-frame read from the frame memory portion. 

The interframe prediction is realized by determining affine parameters from positions of three vertex points of each 
so objective triangular area, performing affine transformation on every pixel existing within the triangular area and repeat- 
ing the same processing on all processable triangular areas to produce a predicted video-frame. 

The produced predicted video-frame is transferred to the before-described subtracting portion and adder portion. 
The motion vectors are transferred to the motion vector coding portion. 

The motion vector coding portion encodes the motion vectors received from the fixed-area predicted-frame gen- 
55 erating portion and outputs them as coded side-information. 

The motion-vector coding device encodes the motion vectors by the following method; 

l^otion vectors are generally encoded not directly but by predictive method. Namely, a predictive value of each 
codable objective motion vector is determined and a difference between the predicted value and the motion vector 
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value is encoded. By doing so, the efficiency of coding can be improved with a saved annount of information. 

The most simple method of determining a predicted value Is such that just coded vector value is adopted as a 
predicted value of a proceeding motion-vector. According to this method, differences of two successive neighboring 
motion vectors are encoded in turn. By reason of a high correlation between motion vectors, it is possible to effectively 
5 encode motion vectors particularly in the case when neighboring motion vectors have similar values. This prediction 
coding method is adopted In a motion-vector coding method defined by ITU-T Recommendation H.261: 

Another method is to determine a predicted value from a plurality of motion vectors including a just-preceding 
vector. In this case, a prediction value is determined by using values of three motion-vectors which are located, re- 
spectively, left, just above and above right from a coding objective motion-vector The prediction value may be a mean 
10 value or a median of three values of the three neighboring motion-vectors. In comparison with the predicting method 
using only just-preceding motion vector this method can use a wider range of correlation, i.e.. higher correlation of 
motion vectors, attaining a further improved coding efficiency. The video coding system defined by the ITU-T Recom- 
mendation H.263 adopts the prediction method for coding predicted a motion vector by using a median of the three 
motion-vector values, which Is well known to be more effective. 
IS The structure and operation of a motion-compensated interframe-predicting portion of a conventional video-de- 

coding device will now described as follows: 

The motion-compensated interframe predicting portion is composed of a fixed-area-size predicted-frame gener- 
ating portion and a motion-vector coding portion. 

Coded side-information inputted into the motion-compensated interframe predicting portion is transferred to the 
20 motion-vector coding portion which in turn decodes the received coded side-information, obtains motion vectors and 
outputs the motion vectors to the fixed-area-size predicted-frame generating portion. 

The fixed-area-size predicted-frame generating portion performs interframe prediction processing by using the 
motion vectors received from the motion vector decoding portion and a reference video-frame received from the frame 
memory portion. 

2S Interframe prediction is to determine affine parameters from positions of three vertices of an objective triangular 

area and motion vectors and then to perform affine transformation of all pixels within the triangular area by using the 
determined affine parameters. This processing is repeated on all objective triangular areas to generate a. predicted 
video-frame. 

The fixed-area-size predicted-frame generating portion outputs the obtained predicted video-frame to an adder 
30 portion. 

Some kinds of methods used for motion-compensated interframe prediction in the above-mentioned video-coding 
and video-decoding devices will be described below. 
A motion vector searching method is as follows: 

A so-called "pixel-matching" method that weights generally a center pixel in a processable area is used for deter- 
55 mining a motion vector of a control grid point. The processable area is composed of 21 pixels for instance each in X- 
direction and Y-direction. The pixel matching is such that an area of a reference frame, which matches with the proc- 
essable area of a current frame is determined by calculation and then motion vectors are determined as displacement 
values in X- direction and Y-direction. In matching calculation, a difference between the processable area of the current 
video-frame and the matched area of the reference frame processable is multiplied by a weighting coefficient. By doing 
40 so, motion vectors are searched putting a weight on a center pixel in the processable area. 

Motion vectors at control grid points located at the periphery of a video-frame are set as follows: 

Motion vectors at quadrangular control grid points: 

X- and Y-components of each vector are set both at 0. 
45 . Motion vectors at the top and bottom control grid points: 

An X-component is determined as an X-component of a motion vector detected at a control grid point one 
inside from an objective control grid point. A Y-component is set at 0. 
Motion vectors at the left-side and right-side control-grid points: 

50 An X-component of each motion vector is set at 0. A Y-component is determined as a Y-component of a motion 

vector detected at a control grid point one inside from an objective control grid point. 
The affine transformation is described as follows: 

The affine transformation is conducted by representing a map from a video-frame to another video-frame by using 
six parameters. 

55 The affine transformation is conducted, for the sake of simplifying calculation of affine parameters, usually on a 

triangular area. 

Motion vectors of control grid points A, B. C and D of the current video-frame are detected at corresponding control 
grid points A'. B", C and D' of a reference video-frame. 
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Three of four control grid points are first selected and an area is divided to determine affine parameters. For 
example, the area on the current video-trame is divided into two triangles ABC and BCD on the current video-frame 
and a corresponding area on the reference video-frame is divided into two angles A'B'C* and B'C'D'. 

On the triangles into which the area Is divided, affine parameters are determined from vertex positions of each 
5 triangle (vertex positions of one triangle and motion vectors of the other may be used). 

A predictive video-frame is generated by mapping thereto all pixels of all triangular sub-areas according to the 
obtained affine parameters. 

If a position of any referred pixel in the reference video-frame is not an integer, a predicted value is bilinearly 
interpolated to determine a pixel value of the predicted video-frame. 
10 A predicted video-frame is generated by performing the above-mentioned processing operations. 

The motion-compensated intertrame prediction according to the above-mentioned prior art using affine transfor- 
mation of a fixed area-size has such a drawback that the efficiency of motion prediction is decreased when an object 
to be mapped has an insufficient size or its periphery is divided by a plurality of areas to create a considerable difference 
between the object edge and the area dividing lines. 
15 The motion-compensated interf rame prediction with the affine transformation may be accompanied by geometrical 

distortion peculiar to the affine transformation, resulting in that the prediction efficiency is decreased with deterioration 
of the quality of coded video frames. The motion-compensated interf rame prediction may have a considerable decrease 
of the prediction efficiency if an input video contains a motion of an objective area by a parallel displacement that can 
not correctly be represented. 

20 High accuracy of setting motion vectors of control-grid points at the peripheral part of a video-frame may be im- 

possible because they are set affinely from respective inside points of the control grid. 

In the case of conducting affine transformation of a variable in size area, the number of control grid poihts must 
be increased and therefore the number of motion vectors is increased as compared with the conventional affine trans- 
formation of a fixed-size area. Accordingly, motion vectors must be encoded at a high efficiency. 

25 In the affine transformation of the variable in size area, the number of control grid points varies with the size of a 

selected area, i.e., the number of motion vectors varies with the number of the control grid points. Consequently motion 
vectors may be arranged dispersively in the video-frame. This makes it difficult to apply the conventional motion-vector 
coding method as it is. The coding efficiency is also decreased. It is necessary to use a highly efficient motion-vector 
coding system adapted to variable area-size affine transformation. 

30 Another operation of a motion-compensated interframe predicting portion in a video-coding device and a video- 

decoding device and methods applied therein will be described below. 

An exemplified structure of a motion-compensated interframe predicting portion and its operation will be first de- 
scribed as follows: 

The motion-compensated interf rame-predicting portion comprises a motion-vector searching portion, a predicted- 
35 frame generating portion and a motion-vector coding portion. 

The motion-vector searching portion searches motion vectors in an input video-frame and a reference video-frame 
read from a frame memory portion and outputs the detected motion-vectors to the predicted frame generating portion. 

The predicted frame generating portion generates a predictive image by using any one of prediction methods, i. 
e., block-displacement method, affine transformation method and bilinear transformation method, etc. 
40 In the process of generating a predicted video-frame, side-information such as area-information and motion vectors 

can be obtained. 

The generated predicted video-frame is transferred to the subtracting portion and the adder portion while the side- 
information is transferred to the side-information coding portion. 

The side-information coding portion encodes the side-information received from the predicted-frame generating 
. - 45 portion and outputs the coded side-information. 

Another structure and another operation of the motion-compensated interf rame-predicting portion of a conventional 
video-decoding device wili now described as follows: 

The motion-compensated interframe predicting portion is composed of a predicted-frame generating portion and 
side-information decoding portion. 
so Coded side-information inputted into the motion-compensated interframe predicting portion is transferred to the 

side-information decoding portion. 

The side-information decoding portion decodes the received coded side-information, obtains decoded side-infor- 
mation including area-information and motion vectors, etc. , and outputs the decoded side-information to the predicted- 
frame generating portion. 

55 The predicted-frame generating portion performs interframe-prediction processing by using the side-information 

received from the side-information decoding portion and the reference video-frame received from the frame memory 
portion and outputs the produced predicted video-frame to the adder portion. 
I The prior art uses a fixed method for motion^ompensated intertrame prediction. Therefore, a conventional device 
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using affine transformation area-size can not be adapted to the case when the number of codes can be more reduced 
in total by using the block displacement prediction. 

The motion-compensated interframe prediction with the affine transformation may be not always performed with 
sufficient prediction efficiency depending on shape, size and position of the object to be mapped. 



SUMMARY OF THE INVENTION 

The present invention relates to a video coding device for encoding video information by reducing volume of in- 
formation contained in a video signal and a video decoding device for decoding coded information Into a video signal 

10 and. more particularly, to a coding device tor conducting Interframe prediction per pixel by determining a motion vector 
from a representative motion vector according to a motion-vector-interpolation predicting method, to a video coding 
device and a video decoding device, both of which use motion-compensated interframe prediction method by applying 
affine transformation, to video-coding and video-decoding devices which are provided with an adaptive motion-com- 
pensated interframe prediction system. 

IS The present invention was made to provide a video coding device and a video decoding device, which can achieve 

high efficiency of interf rarpe prediction and coding of video signals by accurately determining pixel motion vectors with 
a minimized effect of motion vectors irrelevant to each remarked pixel by previously weighting the representative motion 
vectors to be used for determining therefrom motion vectors for respective pixels. 

The present invention was made to provide a video-coding and decoding device which is capable of determining 

20 accurate motion vectors between successive video-frames, dividing each frame Into areas optimal to the size and 
shape of an object In the video-frame, efficiently ericoding motion vectors, decoding the efficiently encoded motion 
vectors and reproducing a predicted video-frame obtained by predicting with the frame areas optimally divided in ac- 
cordance with the size and shape of the object in the frame. The global object of the present invention Is to realize 
highly efficient motion-compensated interframe prediction of video-frames and communication of video-information by 

25 using communication lines of a lower bit-rate. 
The present Invention provides: 

(1) a video coding device comprising frame memory means for storing already encoded video signals, motion 
vector detecting means for determining a representative motion vector per unit-area from a difference between ah- 

30 input video signal and a video signal read from the frame memory means, motion vector interpolating means for 

determining a motion vector per pixel from the representative motion vector, pixel value predicting means for pre- 
paring a predicted video signal from a video signal read from the frame memory means by using the motion vector 
per pixel, wherein weighting coefficient control means for instructing the vector Interpolating means to weight a 
representative motion vector is provided for determining a weighting coefficient for each representative motion 

35 vector determined by the motion vector detecting means and instructing the motion vector interpolating means to 

weight each representative motion vector; 

(2) a video-coding device as defined In item (1). characterized in that the weighting coefficient control means 
selects one of previously prepared patterns of weighting coefficients for representative vectors and instructs the 
motion vector Interpolating means; 

40 (3) a video-coding device as defined in any one of items (1) and (2), characterized in that the weight coefficient 

control means determines a vector weighting cpefficient according to a direction of each representative motion 
vector; and 

(4) a video-coding device as defined in any one of Items (1 ) and (2), characterized In that the weighting coefficient 
control means determines a vector weighting coefficient according to a vector value of each representative motion 
45 vector. 



(5) a video decoding device comprising frame memory means for storing already decoded video signals, vector 
interpolating means for determining a motion vector per pixel from a representative motion vector inputted per unit 
area, pixel value predicting means for preparing a predicted video signal from a video signal read from the frame 
memory means by using the motion vector per pixel, wherein weighting coefficient control means for instructing 
the vector interpolating means to weight a representative motion vector is provided for determining a weighting 
coefficient for each representative motion vector instructing the motion vector interpolating means to weight each 
representative motion vector; 

(6) a video-decoding device as defined in item (5). characterized in that the weighting coefficient control means 
selects one of previously prepared patterns of weighting coefficients for representative vectors and Instructs the 



(7) a video-decoding device as defined in anyone of items (5) and (6), characterized In that motion vector converting 
means is provided for determining a motion vector for a skipped frame from a representative motion vector Inputted 
for a unit area of encoded image frame (screenful) and an Interpolating video signal corresponding to an image 
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motion vector interpolating means; and 
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frame thinned-out without having been encoded is prepared. 



In the above-nnentioned video-coding device, the frame memory portion stores already coded video-signals as a 
reference video frame to be used for interf rame prediction. The motion vector detecting portion receives an input video 
frame signal to be encoded and reads a reference video frame stored in the frame memory ahd searches in said 
reference frame so that an area being most similar to a current processable unit -area of an input video frame is found. 
The displacement of the found similar area in the reference frame from the processable unit-area is outputted as a 
motion vector. The weighting coefficient control portion determines a weighting coefficient for each motion vector of 
the unit-area and informs the motion vector interpolating portion of the weighting coefficients specified for respective 
motion vectors. The weighting coefficients are determined in such a way that the interfering effect of the respective 
representative motion vectors representing different motions of different objects in an image is reduced enough to 
obtain a correct displacement of a current remarked pixel. The motion vector interpolating portion determines motion 
vectors of respective pixels by using the motion vectors outputted from the motion vector detecting portion and the 
weighting coefficients specified by the weighting coefficient control portion and outputs the calculated results. At this ^ 
time, a combination of a devised method for weighting each motion vector and affine/bilinear transformation technique 
in comparison with the conventional method can more accurately determine a motion vector of each pixel, attaining a 
higher efficiency of interframe prediction and video-signal coding. In the pixel value predicting portion, the motion 
vectors of the respective pixels outputted from the motion vector interpolating portion are used as displacement values 
of the respective pixels from the corresponding pixels of the reference frames and the corresponding pixel values are | 
read from the frame memory to compose a predicted frame. 

In the weighting coefficient control portion, it is also possible to previously prepare several patterns of weighing 
coefficients, select a suitable one of the patterns and give only information about the selected pattern, instead of direct 
indication of the values, to the motion vector interpolating portion. 

Furthermore, the weighting coefficient control portion may also determine a weighting coefficient according to only 
a direction of each representative motion vector in such a way that a motion vector Indicating a particular direction is 
weighted with a small weight or, if there is a large dispersion of directions of motion vectors, only motion vectors of 
nearest neighbors to a remarked pixel are weighted with a larger weight. This method is effective to easily determine 
correct weighting coefficients, improving the efficiency of interframe prediction. 

The weighting coefficient control portion may also determine a vector weighting coefficient according to a vector 
value of each representative motion vector by discriminating a dispersion of motion vectors and separating a motion 
vector that has a particular value representing the motion of another object in an image. This is effective to obtain 
adequate weighting coefficients for determining accurate motion vectors of respective pixels. The efficiency of inter- 
frame prediction can be improved. 

Furthermore, the present invention provides: 

(8) A video coding device, which Is used for encoding predicted-error information representing a difference between 
an input video-frame and a predicted video-frame obtained by performing motion-compensated interframe predic- 
tion, and whose motion-compensated interframe predicting portion is provided with a variable-area predicted-frame 
generating portion for dividing a processable area of a video-frame into suitable areas according to motion vectors 
and a reference video-frame and generating a predicted frame by using affine transformation and an area-dividing 
pattem deciding portion for controlling dividing of a processable area and outputting a predicted video-frame and 
side-information such as motion vector information and area-dividing information; 

(9) A video-coding device, which is used for encoding predicted-error information representing a difference between 
an input video-frame and a predicted video-frame obtained by performing motion-compensated interf rame predic- 
tion, and whose motion-compensated Interframe predicting portion is provided with a variable-area predicted-frame 
generating portion for generating a predicted video-frame by translational displacement of a processable area and 
dividing a processable area of a video-frame into suitable areas according to motion vectors and a reference video- 
frame and generating a predicted frame by using affine transformation and an area-dividing pattem deciding portion 
for controlling dividing of a processable area and outputting a predicted video-frame and side -information such as 
motion vector information and area dividing information; 

(10) A video coding device, which is used for encoding predicted-error information determined as a difference 
between an input video-frame and an predicted video-frame obtained by executing motion-compensated interframe 
prediction, and whose motion-compensated Interf rame -predicting portion is provided with an effective-area select- 
ing portion for selecting a valid processing mask or a invalid processing mask depending upon location of an 
objective area in a video-frame when searching a motion vector; 

(1 1 ) A video coding device, which has the same construction as defined in any one of (8) and (9) above and used 
for encoding predicted-error information determined as a difference between an Input video-frame and an predicted 
video-frame obtained by executing motion-compensated interframe prediction, and whose motion-compensated 
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interframe-predicting portion is further provided with an effective-area selecting portion for selecting a valid 
processing mask or a invalid processing mask depending upon location of an objective area in a video-frame when 
searching a motion vector; 

(12) A video coding device, which is used for encoding predicted error information determined as a difference 
5 between an input video-frame and an predicted video-frame obtained by executing motion-compensated interf rame 

prediction, and whose motion-vector coding portion is provided with a side-information coding portion for encoding 
an additional motion vector as a difference from a mean of four basic motion-vectors; 

( 1 3) A video coding device, which has the same construction as defined in any one of (8) and (9) above and wherein 
the motion-vector coding portion of the area-dividing pattern deciding portion is further provided with a side-infor- 

10 malion coding portion for encoding an additional motion vector as a difference from a mean of four basic motion- 

vectors; 

(14) A video coding device, which is used for encoding predicted error information determined as a difference 
between an input video-frame and an predicted video-frame obtained by executing motion-compensated interf rame 
prediction, wherein a motion-vector coding portion is provided with a side-information coding portion for encoding 

IS an additional motion vector positioned between two basic motion-vectors as a difference from a mean of the two 

basic motion-vectors and encoding a center additional motion-vector as a difference from a mean of four basic 
motion-vectors; 

(15) A video coding device, which has the same construction as defined in any one of (8) and (9) above, wherein 
the motion-vector coding portion of the area-dividing pattern deciding portion is further provided with a side-infor- 

20 mation coding portion for encoding an additional motion vector positioned between two basic motion-vectors as a 

difference from a mean of the two basic motion-vectors and encoding a center additional motion-vector as a dif- 
ference from a mean of four basic motion-vectors; 

(16) A video coding device, which is used for encoding predicted error information determined as a difference 
between an input video-frame and an predicted video-frame obtained by executing motion-compensated interframe 

2$ prediction, and whose motion-vector coding portion is provided with a side-information coding portion for encoding 

an objective motion-vector value by prediction from values of three motion-vectors which are already encoded 
basic or additional motion-vectors existing at the left-side, the just upper-side and the upper right-side of the object 
motion vector; 

(17) A video coding device, which has the same construction as defined in (16) above, and whose motion-vector 
30 coding portion determines a mean of the three motion vectors as a predicted value of the objective motion-vector 

and encodes a difference between the objective motion-vector and the predicted value; 

(18) A video coding device, which has the same construction as defined in (16) above, and whose motion-vector 
coding portion determines a median of the three motion vectors as a predicted value of the objective motion-vector 
and encodes a difference between the objective motion-vector and the predicted value; 

35 (1 9) A video coding device, which has the same construction as defined in any one of (8) and (9) above, and whose 

area-dividing pattern deciding portion is further provided with the motion-vector coding portion defined in any one 
of (16), (17) and (18) above; 

(20) A video coding device, which has the same construction as defined in anyone of (8), (9), (11), (13). (15) and 

(19) above, and whose motion-compensated interframe-predicting portion is further provided with an area-dividing 
40 pattern deciding portion for instructing kinds of dividing pattems of all areas to the variable^rea predicted-frame 

generating portion, adopting an area-dividing pattern minimizing a predicted-error (error information value), out- 
putting motion-vectors and area-dividing information to side-information coding portion and outputting a predicted 
video-frame; . 

(21) A video coding device, which has the same.construction as defined in any one of (8), (9), (11). (13), (15) and 
^5 (19) above, and whose motion-compensated interframe-predicting portion is further provided with an area-dividing 

deciding portion which instructs the variable-area predicted-frame generating portion to divide an area into two 
small areas as an initial setting and/or make prediction by translational displacement, instructs the variable-area 
predicted-frame generating portion to more finely divide the area again if a predicted error (error information value) 
of a predicted-frame generated with the initial setting exceeds a preset threshold value, and which deciding portion 
50 outputs motion-vectors and area-dividing information to the side-information coding portion when the prediction 

error (error information value) became smaller than the preset threshold value, and also outputs a predicted video- 
frame; 

(22) A video decoding device, which has the same construction as defined in any one of (8), (9). (11). (20) and 
(21 ) above and is further provided with a subtracting portion whereto the input video frame and the predicted video- 

55 frame from the motion-compensated interframe-predicting portion are transferred, a video coding portion for en- 

coding an error (differential) video-frame from the subtracting portion according to an instruction of the coding 
control portion, a video encoding portion for decoding a coded video frame from the video coding portion, an adder 
portion for adding a predicted video-frame from the motion-compensated interframe-predicting portion to a decoded 
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video-frame from the videoTecoding portion and a frame memory for storing^ reference video-frame from the 
adder portion and outputting stored information to the motion -compensated interframe-predicting portion: 

(23) A video decoding device, which is used for reproducing a video-frame from a coded side-information inputted 
from a video-coding device and a reference video-frame Inputted from a frame memory, wherein a motion-com- 
pensated interframe-predicting portion for generating and outputting a predicted video-frame by changing a coda- 
ble area-size is provided with a side-information decoding portion for decoding coded side-information from the 
video coding device and obtaining thereby motion-vectors and area-dividing Information and a variable-area pre- 
dicted video-frame generating portion for generating a predicted video-frame by using motion vectors and area 
dividing information from the side-information decoding portion and a reference video-frame from the frame mem- 
ory; 

(24) A video decoding device, which Is used for reproducing a video-frame from a coded side-information Inputted 
from a video-coding device and a reference video-frame inputted from a frame memory, and which is further pro- 
vided with a side-Information decoding portion for decoding four basic motion-vectors contained In the side-Infor- 
mation before decoding motion -vectors and for decoding additional motion-vectors representing a difference from 
a mean of the basic vectors; 

(25) A video decoding device, which has the same construction as defined in (16) above and whose slde-infornr\a- 
tion decoding portion decodes four basic motion-vectors contained in the side-information before decoding motion- 
vectors and for decoding additional motion-vectors representing a difference from a mean of the basic vectors; 

(26) A video decoding device, which is used for reproducing a video-frame from a coded side-information Inputted 
from a video-coding device and a reference video-frame inputted from a frame memory, wherein a side-information 
decoding portion is provided for decoding four basic motion-vectors contained In the side-information before de- 
coding motion-vectors and for decoding an additional motion-vector positioned between two basic motion-vectors 
as a difference from an obtained mean of the two basic motion-vectors and decoding a center additional motion- 
vector as a difference from an obtained mean of four basic motion-vectors; 

(27) A video decoding device, which has the same construction as defined in (16) above, and wherein a side- 
information decoding portion decodes four basic motion-vectors contained in the side-information before decoding 
motion-vectors and decodes an additional motion-vector positioned between two basic motion-vectors as a differ- 
ence from an obtained mean of the two basic motion-vectors and decoding a center additional motion-vector as 
a difference from an obtained mean of four basic motion-vectors. 

(28) A video decoding device, which Is used for reproducing a video-frame from a coded side-information inputted 
from a video-coding device and a reference video-frame inputted from a frame memory, and which is provided 
with a side-information decoding portion which decodes a objective motion-vector value by prediction from values 
of three motion-vectors which are already decoded basic or additional motion-vectors existing at the teft-side. the 
just upper-side and the upper right-side of the object motion vector; 

(29) A video decoding device, which has the same construction as defined In (28) above, and whose side-infor- 
mation decoding portion determines a mean of the three motion-vectors as a predicted value of the objective 
motion-vector and obtains a motion-vector value of the decoded-object by adding the decoded difference value to 
the predicted value; 

(30) A video decoding device, which has the same construction as defined In (28) above, and whose side-infor- 
mation decoding portion determines a median of the three motion vectors as a predicted value of the objective 
motion-vector and obtains a decoded objective motion-vector value by adding the decoded difference value to the 
predicted value; 

(31 ) A video decoding device, which as defined In (23), and whose the side-information decoding portion is further 
provided with the motion-vector decoding system as defined in any one of (28). (29) and (30) above; 

(32) A video decoding device, which has the same construction as defined in any one of (23), (25), (27) and (31 ) 
above, and which is further provided with a video decoding portion for decoding a coded video-information inputted 
from a video coding device and outputting the decoded video-information, an adder portion for adding a decoded 
video-frame inputted from the video decoding portion to a predicted video-frame inputted from the motion-com- 
pensated interframe predicting portion and outputting an obtained video-frame as an output video-frame and a 
memory frame for storing a video-frame from the adder portion and for outputting said stored video-frame as a 
reference video-frame to the frame inputted from the motion-compensated Interframe predicting portion. 

(33) a video coding device using adaptive motion-compensated interframe prediction, which comprises a predicting 
portion for generating a plurality of predictive images of small variable-size areas by applying different predicting 
methods to each of the small variable-size areas In the process of the motion -compensated Interframe prediction 
and outputting the generated predictive images, an area prediction deciding portion for determining an adaptive 
area size and an adaptive prediction method according to a plurality of the predictive images received from the 
predicting portion and outputting side-information composed of area-information, prediction-mode Information, mo- 
tion-vectors and so on, and a side-information coding potion for encoding the side-information outputted from the 
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area prediction deciding portion; 

(34) a video coding device using adaptive motion-connpensated interf rame prediction, which comprises a predicting 
portion for generating a plurality of predictive images of respective unit areas or further-divided subareas by ap- 
plying different predicting methods to each of the unit areas or each of the subareas in the process of the motion- 

5 compensated interframe prediction and then outputting the the generated predictive images, an area prediction 

deciding portion for determining an area-information indicating whether the predicted images received from the 
predicting portion are unit-area images or subarea images and prediction methods applied to the respective pre- 
dicted Images and outputting side-information including area-information, prediction mode information, motion- 
vectors and the like, and a side-information coding portion for encoding the side-information received from the 

10 area prediction deciding portion; 

(35) a video coding device using adaptive motion-compensated interframe prediction, which comprises a predicting 
portion for generating a plurality of predictive images of respective variable-size small areas by using a block- 
displacement (overlapped motion-compensative) predicting method, an affine transform predicting method, bilin- 
ear transform predicting method and a background predicting method, etc. in the process of the motion<ompen- 

15 sated interframe prediction and then outputting the generated predictive images, an area prediction deciding portion 

for determining an adaptive area area-size and an adaptive prediction method from the predictive images received 
from the predicting portion and outputting side-Information including area-Information, prediction mode information 
and motion-vectors, etc., and a side-information coding portion for encoding the side-information received from 
the area prediction deciding portion; 

20 (36) a video coding device using adaptive motion-compensated Interframe prediction, which Is provided with a 

predicting portion for diagonally dividing each encodable unit-area into two or four subareas, producing a predictive 
image for each subarea by affine transformation in the process of the motion-compensated Interframe prediction 
and outputting the produced predicted images; 

(37) a video coding device using adaptive motion-compensated interframe prediction, which has the same functions- 
2S as defined in any one of items (33). (34) and (35), and is provided with a predicting portion for diagonally dividing 

each encodable unit-area into two or four subareas. producing a predictive image for each subarea by affine trans- 
formation in the process of the motion-compensated interframe prediction and outputting the produced predicted 
inriages; 

(38) a video decoding device using adaptive motion -compensated interframe prediction, which comprises a slde- 
30 Information decoding portion for decoding coded side-information including area-information, prediction mode in- 
formation and motion-vectors, etc. and outputting the decoded side-information, a predicting portion for generating 
a plurality of predictive images of small variable-size areas by applying different predicting methods to each of the 
small variable-size areas in the process of the motion<ompensated interframe prediction and outputting the gen- 
erated predictive images; and an area-prediction-mode selecting portion for generating an adaptive predictive . 

35 image from a plurality of the predictive images received from the predicting portion according to the information 

received from the side-information decoding portion; . 

(39) a video decoding device using adaptive motion-compensated interframe prediction, which comprises a side- 
information decoding potion for decoding coded side-information including area-information, prediction mode in- 
fornnation and motion-vectors, etc. and outputting the decoded side-information, a predicting portion for generating 

40 a plurality of predictive images of respective decodable unit-areas or further-divided subareas by applying different 

predicting methods to each of the unit areas or each of the subareas in the process of the motion-compensated 
interframe prediction and then outputting the generated predictive images, and an area-prediction-mode selecting 
portion for generating an adaptive predictive image from a plurality of the predictive images received from the 
predicting portion according to the information received from the side-information decoding portion; 

45 (40) a video decoding device using adaptive motion-compensated interframe prediction, which comprises a side- 

Information decoding potion for decoding coded side-information including area-information, prediction mode in- 
formation and motion-vectors, etc. and outputting the decoded side-information; a predicting portion for generating 
a plurality of predictive images of respective decodable unit-area or further divided subareas by using a block- 
displacement (overlapped motion-compensative) predicting method, an affine transform predicting method, bilin- 

50 ear transform predicting method and a background predicting method, etc. in the process of the motion-compen- 

sated interframe compensated interframe prediction and then outputting the the generated predictive images, and 
an area-prediction-mode selecting portion for generating an adaptive predictive image from a plurality of the pre- 
dictive images received from the predicting portion according to the information received from the side-information 
decoding portion; 

55 (41 ) a video decoding device using adaptive motion-compensated interframe prediction, which is provided with a 

predicting portion for diagonally dividing each decodable unit-area into two or four subareas. producing a predictive 
image for each subarea by affine transformation in the process of the motion -compensated interframe prediction 
and outputting the produced predicted images; and 
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(42) a video decoding device using adaptive mot ion -com pen sated interf rame prediction, which has the same func- 
tions as defined in any one of items (39) and (40) and is provided with a predicting portion for diagonally dividing 
each decodable unit-area into two or four subareas, producing a predictive image for each subarea by affine trans- 
formation in the process of the motion -compensated interframe prediction and then out a predictive image and 
outputting the produced predicted images. 

The above-mentioned video-coding and video decoding devices can selectively use any adaptive one of two or 
more methods for motion-compensated interframe prediction, which may minimize the number of codes for represent- 
ing side-information (e.g., motion vectors) or a prediction error. 

In the motion-compensated interframe-prediction by using the affine-transformation technique, it is possible to 
reduce the amount of side-information (e.g., motion vectors) or prediction error in conformity with the shape, size and 
location of an object to be mapped. This could not be realized by the conventional method. 

Owing to these advantages, the video-coding and video-decoding devices according to the present invention can 
achieve a more effective motion-compensated interframe prediction with an increased efficiency of video-coding and 
decoding. Application of the video<oding and video-decoding devices according to the present invention can realize 
video-communications over transmission lines and channels whose transmission bit-rate is very low. 



BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 is a block diagram for explaining the principle of a motion compensated interframe prediction with orthogonal 
transform encoding method. 

Fig. 2 is a simplified block diagram showing an exemplified structure of an interframe predicting portion of a con- 
ventional encoding device. 

Fig. 3 is a view showing the correlation between a unit area to be transformed, a representative point and a rep- 
resentative motion vector. 

Fig. 4 is a simplified block diagram showing an exemplified structure of an interframe predicting portion of a con- 
ventional encoding device. 

Fig. 5 is a schematic illustration of a whole system of a video coding device according to a prior art or the present 
invention. 

Fig. 6 is a schematic illustration of a whole system of a video decoding device according to a prior art or the present 
invention. 

Fig. 7 is illustrative of a motion -compensated interframe predicting portion of a conventional video coding device. 
Fig. 8 is illustrative of a motion-compensated interframe predicting portion of a conventional video decoding device. 
Fig. 9 is view for explaining a weighting method used when searching a motion vector according to the present 
invention. 

Fig. 10 is a view for explaining a control grid and a motion vector according to a prior art. 

Fig. 11 is view for explaining interframe prediction using affine transformation in case of forwarding interframe 
prediction of deformation of an object. 

Fig. 12 shows an exemplified structure of a motion-compensated interf rame-predicting portion of a conventional 
video coding device. 

Fig. 13 shows an exemplified structure of a motion-compensated interf rame-predicting portion of a conventional 
video decoding device. 

Fig. 14 is a simplified block diagram of a video coding device embodying the present invention. 

Fig. 15 is a flow chart showing a first example of operations of a weighting coefficient control portion of a video 
coding device according to the present invention. 

Fig. 1 6 is a flow chart showing a second example of operations of a weighting coefficient control portion of a video 
coding device according to the present invention. 

Fig. 17 is a simplified block diagram of a video decoding device embodying the present invention. 

Fig. 18 is a simplified block diagram showing an exemplified structure of a video decoding device according to the 
present invention. 

Fig. 19 shows an exemplified structure of a motion-compensated interframe predicting portion of a video coding 
device embodying the present invention. 

Fig. 20 shows all kinds of areas divided by a triangle according to the present invention. 

Fig. 21 shows an exemplified structure of a mot ion -com pen sated interframe predicting portion of a video decoding 
device embodying the present invention. 

Fig. 22 is a view for explaining a control grid and a motion vector according to the present invention. 

Fig. 23 shows all kinds of effective-area selecting masks according to the present invention. 

Fig. 24 is a view for explaining a motion-vector predictive coding method according to the present invention. 
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Fig. 25 is a flow chart showing a first example of processing operations of motlon<ompensateci interframe pre- 
diction according to the present invention. 

Fig. 26 is a flow chart showing a second example processing operations of motion-compensated interframe pre- 
diction according to the present invention. 
5 Fig. 27 shows an exemplified structure of a motion -compensated interframe-predicting portion of a video coding 

device according to the present invention. 

Fig. 28 shows an exemplified structure of a motion-compensated interframe-predicting portion of a video decoding 
device according to the present invention. 

Fig. 29 is a view for explaining an example of selecting a prediction method for an processable area. 
10 Fig. 30 is a view for explaining an example of dividing a processable area. 

Fig. 31 is a view showing patterns for dividing an processable area into triangles. 

PREFERRED EMBODIf^ENT OF THE INVENTION 

IS To begin with, prior art will be explained as a reference of the present invention. 

A motion compensative interframe prediction with orthogonal transform encoding method that has been preferably 
used is such that the temporal redundancy of Information is first reduced through the interframe prediction. Fig. 1 shows 
the principle of the motion compensative Interframe prediction with orthogonal transfomi encoding method. The motion 
compensative interframe predicting portion 1191 prepares predicted value of an Input video signal from an already 

20 encoded signal stored in a frame memory 1193 and outputs a difference between the predicted value and the input 
video signal as a prediction error signal. A prediction error signal encoding portion 1192 encodes the prediction error 
signal by orthogonal transforming method to further reduce redundancy of the information. The encoded prediction 
error signal is locally decoded, then stored in the frame memory 11 93 and is used for prediction of a proceeding frame. 
In very low bit-rate video encoding, it is necessary to represent a video-signal with a very small amount of Infor- 

25 mation. An amount of Information allocated to the orthogonal transform encoding (i.e., prediction error encoding), there- 
fore. Is considerably limited. Accordingly. It is very Important to improve the efficiency of the interframe prediction by 
using an interframe prediction method that can more correctly predict a change of a video signal with time. 

In the latest years, many studies have been made on interframe prediction methods using affine transformation 
and bilinear transformation. While the motion compensative interframe prediction method represents a movement. 

30 contained In a video as a translational motion by using a vector per unit-area, the method using the affine transformation 
and bilinear transformation can more accurately represent a movement in a video, I.e.. with increased prediction effi- 
ciency since It can represent movement, rotation, enlargement and deformation. 

Fig, 2 is a schematic illustration of an interframe predicting portion of a conventional video-coding device that uses 
affine transformation and bilinear transformation, 

35 An interframe predicting portion of the conventional video-coding device comprises a frame memory portion 11 31 

for storing already coded video signals, a motion vector detecting portion 11 32 for determining a representative motion 
vector per unit area from a difference between an input video signal and a video signal read from the frame memory 
portion 1 1 31 . a motion vector interpolating portion 1 1 33 for determining a motion vector per pixel from the representative 
motion vector and a pixel value predicting portion 1134 for preparing a predicted video signal from a video signal read 

40 from the frame memory portion 1131 by using the motion vector per pixel. 

The operation of the above-mentioned Interframe predicting portion Is as follows: 

The frame memory portion 1131 stores already coded video signals as reference video frames for interframe 
prediction. The motion vector detecting portion 1132 receives an Input video-frame signal to be encoded and reads a 
reference video frame stored In the frame memory 1131. The motion vector detecting portion 11 32 divides the coding 

45 video frarr»e into unit-areas and scans the reference video frame to find an area most similar to a current coding unit- 
area. A displacement of a unit-area of the coding video-frame from the area found in the reference video frame^ js 
outpytteS^s ajTiot^ vector is a representative motion vector representing a interframe displace- 

ment of a representative point within a unit-area (usually, a center of the unit-area). Fig. 3(a) shows an example of 
relationship between a representative point and a vector searched unit-area are specified. In searching a similar area 

50 in a reference video frame, a sum of absolute differential values or a square-sum of differential values of each pixel in 
a unit-area is used as a scale of similarity. Furthermore, a center portion of the unit-area may be weighted by multiplying 
a differential pixel value at the center portion of the unit-area by a large coefficient and a differentia! pixel value at the 
periphery thereof by a small coefficient prior to summation so as to more accurately determine a displacement of the 
representative point. 

55 The representative motion vector is transferred to the motion vector interpolating portion 11 33 which in turn deter- 

mines motion vectors for respective pixels using the received representative motion vectors. For affine Jjransformation. 
a motion vector for each pixel existing vyithin a triangle area surrounded by three neighboring representative points 
(hereinafter called transformable unit-area) is determined by solving an affine transformation expression from the rep- 
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resentatiye motion vectors of the respective representative points. For bilinear transformation, a motion vector for each 
pixel existing within a quadrangular area surrounded by four neighboring representative points (hereinafter called trans- 
formable unit-area) is determined by solving a bilinear transformation expression from the representative motion vectors 
of the respective representative points. For a transformation unit-area being quadrangular or rectangular, it is equivalent 
5 that a motion vector value of each representative point is distributed in a vertical direction and a horizontal direction in 
proportion to an a distance between a remarked pixel and a representative point as shown in Fig. 3(b). 

The pixel value predicting portion 1134 receives the (remarked) pixel motion vectors inputted pixel by pixel and, 
considering the motion vectors to be a displacement of the remarked pixel from its corresponding position in the ref- 
erence. frame, reads a pixel value at the corresponding position from the frame memory 1131 as a predicted value of 
10 the remarked pixel to compose a predicted frame. If the pixel value indicates a position in the reference frame, where 
no pixel exists, e.g., the pixel motion vector (displacement) is a decimal fraction, a neighboring pixel value in the 
reference frame is readout from the frame memory and a predicted value of the remarked pixel is determined as an 
Interpolating value according to a known bilinear interpolation method. 

Fig. 4 shows a simplified structure of an interframe predicting portion of a conventional video-decoding device 
IS using affine and bilinear transformation method. 

The interframe predicting portion of the conventional decoding device comprises a frame memory portion 11 41 for 
storing already decoded video signals, a motion-vector interpolating portion 1142 for determining a motion vector of 
each pixel lrom a representative motion vector inputted for each unit-area, a pixel value predicting portion 1143 tor 
preparing a predicted video signal from a video signal read from a frame memory by using the pixel motion vectors. 
20 Furthermore, the operation of the conventional video-coding device is first explained below. Fig. 5 shows an ex- 

emplified entire system of the conventional video-coding device. 

It is assumed that a reference video frame usable for producing a predictive video-frame is stored in a frame 
memory portion 1 21 6 in a normal operating condition of the conventional video-coding device for conducting the motion- 
compensated interframe predictive coding. 
2S An input video frame enters into a subtracting portion 1211 and a motion-compensated Interframe-predicting portion 

1217, The motion-compensated interframe-predicting portion 1217 determines a predictive motion from the reference 
video-frame stored in the frame memory portion 1216 and the input video-frame and outputs a predicted video-frame 
to the subtracting portion 1211. 

The subtracting portion 1 21 1 subtracts from the input video-frame the predicted video-frame entered by the motion- 
so compensated interframe predicting portion 1217 and outputs the subtraction result, i.e.. predicted error information to 
a video coding portion 1212. 

The video coding portion 1212 spatially transforms the input predicted-error-information, e.g., by using OCT (Dis- 
crete Cosine Transform), quantizes the transformed information and outputs coded video-information. At the same 
time, the coded video-information outputted from the video coding portion 1 21 2 is locally decoded by a video-decoding 
35 portion 1214 and is then transferred to the adder portion 1215. 

The adder portion 1215 adds the predicted video-information outputted from the video-decoding portion 121 4 to 
the predicted error-information outputted from the motion-compensated Interframe-predicting portion 1217 to form a 
new reference video-frame which is then transferred to the frame memory portion 1216. 

The frame memory portion 1216 stores therein the new reference video-frame outputted from the adding portion 
40 1215. This. new reference video-frame is given to the motion-compensated interframe predicting portion 1217 when 
encoding a succeeding input video-frame. 

The video coding device outputs a series of coded video information (predicted error information) and a series of 
coded side-information by repeating the above-mentioned sequence of operations. 

The operation of the conventional video-decoding device is now described below. Fig. 6 is illustrative of an exem- 
45 plified entire system of the conventional video<oding device. 

It is assumed that a reference video-frame usable for producing a predictive video-frame is stored in a frame 
memory portion 1 21 6 in a normal operating condition of the conventional video-coding device for conducting the motion- 
compensated interframe predictive coding. Coded video-information inputted into the video-decoding device enters 
into a video-decoding portion 1 221 which decodes the coded-video information by performing the same operations as 
so the video-decoding portion 1214 of the video-coding device and outputs the obtained differential video-frame to an 
adder portion 1222. 

Coded side-information Inputted into the video-decoding device enters into a motion -compensated interframe pre- 
dicting portion 1223. 

The motion-compensated interframe predicting portion 1223 decodes the received coded side-information and 
55 obtains motion vectors, then it produces a predicted video-frame by using the obtained motion vectors and a reference 
video-frame read from the frame memory portion 1224 and transfers the produced predicted video-frame to an adder 
portion 1222. 

The adder portion 1222 obtains an output video-frame by adding the differential video-frame from the video-de- 
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coding portion 1221 to the predicted video-frame from the motion-compensated interframe predicting portion 1223. 
The output video-frame is outputted from the video-decoding device and at the same time is transferred to the frame 
memory 1224. 

The frame memory 1224 stores therein the video-frame inputted from the adder portion 1224 as a new reference 
5 video-frame that will be used by the motion-compensated Interframe predicting portion 1223 when decoding a pro- 
ceeding video-frame. 

The video decoding device realizes outputting a series of decoded video-frames by repeating the above-mentioned 
sequential operations. 

The operation of a motion-compensated interframe predicting portion In a video-coding device and a video-decod- 
10 ing device and prior arts applied therein will be described below. 

An exemplified structure of a motion-compensated interframe predicting portion and its operation will be first de- 
scribed as follows: 

Fig. 7 shows a conventional example of a motion-compensated interframe predicting portion of a video-coding 
device. 

IS As Fig. 7 shows, the motion-compensated Interf rame-predicting portion 1217 comprises a motion-vector searching 

portion 1 241 . a fixed-area predicted-frame generating portion 1 242 and a motion-vector coding portion 1 243. 

The motion-vector searching portion 1241 searches motion vectors In an Input video-frame and a reference video- 
frame read from a frame memory 1216 and outputs the detected motion-vectors to the fixed-area-size predicted-frame 
generating portion 1242. 

20 vyhen searching motion vectors, the motion-vector searching portion 1241 searches a motion vector of a control 

grid point'by previously weighting a center pixel of a processable area and outputs a motion vector of each control grid 
point to the fixed-area-size predicted-frame generating portion 1242., 

According to the conventional method, a processable area has a fixed size with control grid points set one for 16 
or 8 pixels. Fig. 1 4 illustrates an example of a control grid whose points set one for 1 6 pixels, Indicating motion vectors 

25 existing therein. 

The fixed-area-size predicted-frame generating portion 1242 performs interframe prediction on each processable 
area having a fixed size (generally in a rectangular block of 16 x 16 pixels) by using motion vectors inputted from the 
motion vector searching portion 1241 and a preceding video-frame read from the frame memory portion 1216. 

The Interframe prediction is realized by determining afflne parameters from positions of three vertex points of each 
30 objective triangular area, performing affine transformation on every pixel existing within the triangular area and repeat- 
ing the same processing on all processable triangular areas to produce a predicted video-frame. 

The produced predicted video-frame Is transferred to the before-described subtracting portion 1211 and adder 
portion 1215. The motion vectors are transferred to the motion vector coding portion 1243. 

The motion vector coding portion 1243 encodes the motion vectors received from the fixed-area predicted-frame 
35 generating portion 1242 and outputs them as coded side-Information. 

The motion-vector coding device 1243 encodes the motion vectors by the following method: 
Motion vectors are generally encoded not directly but by predictive method. Namely, a predictive value of each 
codable objective motion vector is determined and a difference between the predicted value and the motion vector 
value Is encoded. By doing so, the efficiency of coding can be improved with a saved amount of information. 
40 The most simple method of determining a predicted value is such that just coded vector value Is adopted as a 

predicted value of a proceeding motion -vector. According to this method., differences of two successive neighboring 
motion vectors are encoded In turn. By reason of a high correlation between motion vectors, it is possible to effectively 
encode motion vectors particulariy In the case when neighboring motion vectors have similar values. This prediction 
coding method is adopted in a motion-vector coding method defined by ITU-T Recommendation H.261 . 
45 Another method is to determine a predicted value from a plurality of motion vectors including a just-preceding 

vector. In this case, a prediction value is determined by. using values of three motion-vectors which are located, re- 
spectively, left, just above and above right from a codable objective motion-vector. The prediction value may be a mean 
value or a median of three values of the three neighboring motion-vectors. In comparison with the predicting method 
using only just-preceding motion vector this method can use a wider range of correlation. I.e., higher correlation of 
50 nnotlon vectors, attaining a further improved coding efficiency The video coding system defined by the ITU-T Recom- 
mendation H.263 adopts the prediction method for coding predicted a motion vector by using a median of the three 
motion-vector values, which is well known to be more effective. 

The structure and operation of a motion-compensated interframe-predicting portion 1223 of a conventional video- 
decoding device will now described as follows: 
55 Fig. 8 Is illustrative of an exemplified structure of the motion-compensated interframe-predicting portion of the 

video-decoding device. 

As shown In Fig. 8. the motion-compensated interframe predicting portion 1223 is composed of a fixed-area-size 
predicted-frame generating portion 1251 and a motion-vector coding portion 1252. 
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Coded side-information inputted into the nnot ion-compensated interframe predicting portion 1223 is transferred to 
the motion-vector coding portion 1252 which in turn decodes the received coded side-information, obtains motion 
vectors and outputs the motion vectors to the fixed-area-size predicted-frame generating portion 1251 . 

The fixed-area-size predicted-frame generating portion 1251 performs interframe prediction processing by using 
the motion vectors received from the motion vector decoding portion 1252 and a reference video-frame received from 
the frame memory portion 1224. 

Interframe prediction is to determine affine parameters from positions of three vertices of an objective triangular 
area and motion vectors and then to perform affine transformation of all pixels within the triangular area by using the 
determined affine parameters. This processing is repeated on all objective triangular areas to generate a predicted 
video-frame. 

The fixed-area-size predicted-frame generating portion 1251 outputs the obtained predicted video-frame to an 
adder portion 1222. 

Some kinds of methods used for motion-compensated interframe prediction in the above-mentioned video-coding 
and video-decoding devices will be described below. 
A motion vector searching method is as follows: 

A so-called •pixel-matching" method that weights generally a center pixel In a processable area is used for deter- 
mining a motion vector of a control grid point. Fig. 9 is an illustration of a weighted area of pixels. In Fig. 9, the proc- 
essable area is composed of 21 pixels for instance each in X-direction and Y-direction. The pixel matching Is such that 
an area of a reference frame, which matches with the processable area of a current frame is determined by calculation 
and then motion vectors are determined as displacement values In X- direction and Y-directlon. In matching calculation, 
a difference between the processable area of the current video-frame and the matched area of the reference frame 
processable is multiplied by a weighting coefficient. By doing so. motion vectors are searched putting a weight on a 
center pixel in the processable area. 

Motion vectors at control grid points located at the periphery of a video-frame are set as follows: 

Motion vectors at quadrangular control grid points: 

X- and Y-components of each vector are set both at 0. 
Motion vectors at the top and bottom control grid points (triangles shown in Fig. 10): 

An X-component is determined as an X-component of a motion vector detected at a control grid point one 
inside from an objective control grid point. A Y-component Is set at 0. 
Motion vectors at the left-side and right-side control-grid points (circles shown in Fig. 10): 

An X-component of each motion vector is set at 0. A Y-component is determined as a Y-component of a motion 
vector detected at a control grid point one inside from an objective control grid point. 
The affine transformation is described as follows: 

The affine transformation is conducted by representing a map from a video-frame to another video-frame by using 
six parameters. 

The affine transformation is conducted, for the sake of simplifying calculation of affine parameters, usually on a 
triangular area. 

Fig. 11 ls:*a view for explaining a method for interframe prediction by using affine transformation in the case of 
forward predicting. 

Motion vectors of control grid points A, B. C and D of the current video-frame are detected at corresponding control 
grid points A*,' B', C and D' of a reference video-frame. 

Three of four control grid points are first selected and an area is divided to determine affine parameters. For 
example, the area on the current video-frame Is divided into two triangles ABC and BCD on the current video-frame 
and a corresponding area on the reference video-frame Is divided into two triangles A'B'C and B'C'D*. 

On the triangles into which the area Is divided, affine parameters are determined from vertex positions of each 
triangle (vertex positions of one triangle and motion vectors of the other may be used). 

A predictive video-frame is generated by mapping thereto all pixels of all triangular sub-areas according to the 
obtained affine parameters. 

If a position of any referred pixel in the reference video-frame is not an integer, a predicted value is bilinearly 
Interpolated to determine a pixel value of the predicted video-frame. 

A predicted video-frame is generated by performing the above-mentioned processing operations. 

The operation of a motion -compensated interframe predicting portion in a video-coding device and a video-decod- 
ing device and methods applied therein will be described below. 

An exemplified structure of a mot ion -compensated interframe predicting portion 1217 (shown in Fig. 5) and its 
operation will be first described as follows: 

As shown in Fig. 12, the motion-compensated interframe-predicting portion 1217 comprises a motion-vector 
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searching portion 1341, a predicted -frame generating portion 1342 and a motion-vector coding portion 1343. 

The motion-vector searching portion 1 341 searches motion vectors in an input video-frame and a reference video- 
frame read from a frame memory portion 1216 and outputs the detected motion-vectors to the predicted frame gener- 
ating portion 1 342. 

5 The predicted frame generating portion 1 342 generates a predictive image by using any one of prediction methods, 

i.e.. block-displacement method, affine transformation method and bilinear transformation method, etc. 

In the process of generating a predicted video-frame. side-Information such as area-information and motion vectors 
can be obtained. 

The generated predicted video-frame is transferred to the subtracting portion 1211 and the adder portion 1215 
TO while the side-information is transferred to the side-information coding portion 1 343. 

The side-information coding portion 1 343 encodes the side-information received from the predicted-frame gener- 
ating portion 1342 and outputs the coded side-information. 

The structure and the operation of the motion-compensated interframe-predicting portion 1223 (shown in Fig. 8) 
of a conventional video-decoding device will be now described as follows: 
'5 As shown in Fig. 1 3. the motion-compensated interf rame predicting portion 1 223 is composed of a predicted-frame 

generating portion 1351 and side-information decoding portion 1352. 

Coded side-information inputted into the motion-compensated interf rame predicting portion 1223 Is transferred to 
the side-Information decoding portion 1352. 

The side-Information decoding portion 1 352 decodes the received coded side-information, obtains decoded side- 
20 Information including area-Information and motion vectors, etc., and outputs the decoded side-Information to the pre- 
dicted-frame generating portion 1 351 . 

The predicted-frame generating portion 1351 performs Interframe-predictlon processing by using the side-infor- 
mation received from the side-information decoding portion 1352 and the reference video-frame received from the 
frame memory portion 1324 and outputs the produced predicted video-frame to the adder portion 1222. 
25 An embodiment of the present invention will be explained below referring to the accompanying drawings. Fig. . 14 

shows an exemplified structure of an interframe predicting portion of a video coding device according to the present 
invention. 

As shown In Fig. 14, the video coding device comprises a frame memory portion 111 for storing already encoded 
video signals, a motion vector detecting portion 112 for determining a representative motion vector per unit area from 

30 a difference between an input video signal and a video signal read from the frame memory portion 111 , a weighting 
coefficient control portion 1 1 3 for determining weighting coefficient lor each representative motion vector and Instructing 
the weighting of each representative motion vector, a motion vector Interpolating portion 114 for determining a motion 
vector per pixel from a representative motion vector, a pixel value predicting portion 11 5 for preparing a predicted video 
signal from a video signal read from the frame memory portion 11 1 by using each motion vector per pixel. 

35 The frame memory portion 111. the motion vector detecting portion 112 and the pixel value predicting portion 115 

are similar in function to those of the conventional video coding device. The video coding device according to the 
present invention differs from the conventional device by provision of the weighting coefficient control portion 11 3 for 
specifying a weighting coefficient for each representative motion vector and a vector interpolating portion 114 which 
determines a motion vector per pixel from a representative motion vector outputted by the motion vector detecting 

40 portion 112 and by using a weighting coefficient for each vector, said coefficient being outputted from the weight coef- 
ficient control portion 113. 

The operations of the weighting coefficient control portion 1 1 3 and the motion vector interpolating portion 1 1 4, both 
of which are essential portions of the embodiment, are as follows: 

In this embodiment, the motion vector Interpolating portion 114 uses the bilinear transformation method for deter- 

45 mining a motion vector of a pixel but it may also use the affine transformation method. Like in the above-mentioned 
conventional coding device, the motion vector interpolating portion 114 performs operation on each unit-area. Namely 
a unit-area for bilinear transformation is an area enclosed by four neighboring representative points (three represent- 
ative points in the case of the affine transformation) corresponding to representative motion vectors outputted from the 
motion vector detecting portion 112. The weighting coefficient control portion 113 decides weighting coefficients for 

so respective motion vectors of respective vertices (representative points) for a unit-area to be transformed. At this time, 
it may be considered that pixels neighboring to the representative points are subjected to most strong effect of vectors 
of respective nearest representative points. Accordingly, a unit-area to be transformed is further divided into subareas 
which are given different weighting coefficients. Fig. 3(c) illustrates that a unit area Is divided into four subareas. 
The weighting coefficient control portion 113 judges whether motion vectors of vertices of a unit-area for transfor- 

55 mation represent a movement of the same object of an image, i.e., said unit-area exists on one object in the Image or 
not. When having judged the motion vectors concern the same object, the weight coefficient control portion 1 1 3 instructs 
the motion vector interpolating portion 114 to evenly weight all representative motion vectors of vertices (e.g., all with 
the same coefficient of 1). When having judged the vertices of the unit-area existing on different objects, the weight 
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coefficient control portion 113 decides weighting coefficients in such a way that a motion vector of a vertex existing on 
the sanne object as a remarked subarea represented by a vertex is weighted with a larger weighting coefficient and a 
motion vector of a vertex judged to exist on a different object is weighted with a smaller weighting coefficient. In the 
extreme case, the weighting coefficient control portion 1 1 3 instructs the motion vector interpolating portion 1 1 4 to weight 
the motion vector of the different object with a weighting coefficient of 0 not to use said vector The determination of 
weight coefficients is conducted subarea by subarea. Weighting is controlled in such a way that a motion vector which 
was given a large weighting coefficient in an subarea is given a small weighting coefficient in other subareas. This is 
illustrated in Fig. 3(d). Weighting coefficients outputted by weighting coefficient control portion 113 may be a series of 
real numbers or they may be determined by selecting the most approximates to an optimal value from discretely defined 
values, e.g., 0. 0.25, 0.5, 0.75 and 1. 

The motion vector interpolating portion 114 weights every representative motion vector with a weighting coefficient 
specified by the weighting coefficient control portion 11 3 and then calculates motion vectors of respective pixels in the 
subarea and outputs the calculation results. When a motion vector of a vertex of a unit-area for transformation is MVj 
(i= 1 , 2. 3, 4) and a weight for the motion vector Wjj (i.j=1 , 2, 3, 4), a weighting method may have by way of example 
as follows: 



WMVj = Swxj-MV, (i,j = l. 2, 3, 4) ... (1) 

The usual bilinear transformation equation may be calculated by using the weighted motion vector WMVj. When 
a motion vector MVP^y of a pixel having coordinates (x, y) is determined from a base motion vector MV; by bilinear 
transformation and a weight Wj (i=1. 2, 3. 4) is given to the motion vector, the following equation is described. 

WMVP^v = wMVP,cy+(l-w) - Swx -MV^ (1 = 1,2.3,4) ... (2) 

A coefficient w Indicates a weighting ratio as a whole. The pixel motion vector f^VP^y is corrected to WMVP^y in 
view of weighting. 

The motion vector Interpolating portion 114 performs the above-mentioned operations for each subarea by using 
the weighting coefficient specified for each subarea by the weighting coefficient control portion 113. 

The weight coefficient control portion 113 may have M sets of weighting coefficients previously prepared, for ex- 
ample: 

W = {Wm} (m=1,2,...M) 



Wm = {Wmij} (i,j = 1,2.3,4) 

It may select one of the prepared coefficients as an optimal weight pattern for each unit-area or subarea. which coef- 
ficient Is nearest to a weighting coefficient determined for the unit-area or subarea by the above-mentioned weighting 
coefficient processing. In this case, the weight coefficient control portion 113 give information about a selected pattern 
to the motion vector Interpolating portion 1 1 4. It is also possible to previously prepare weight patterns of paired weighting 
coefficients, one of which is for one subarea and the other is for other subarea. In this case. It becomes possible to 
specify weighting coefficients for all subareas of a unit-area at a time by specifying only a weight pattern for the unit -area. 

In this instance, the motion vector interpolating portion 114 holds the same weight patterns as those held in the 
weighting coefficient control portion 113, and it determines a motion vector for each pixel in the similar way as described 
before by using weighting coefficients for each motion vector according to the specified weight coefficient pattern. It 
outputs the calculation results. 

Referring to a flow chart shown in Fig. 1 5. a first example of the weighting coefficient control portion 1 1 3 of video 
coding device according to the present Invention will be described as follows: 

The motion vector detecting portion 112 enters representative motion vectors into a weighting coefficient control 
portion 113 (Step 1). The weight coefficient control portion 113 forms a transformable unit area of four neighboring 
representative points (Step 2). Then, it examines signs of horizontal and vertical components of each representative 
point of each vertex and classify directions of motion vectors by extracting only positive, negative or zero signs (Step 
3). The examination results are compared with each other to determine whether motion vectors of four vertices have 
roughly matched directions (Steps 4 and 5). If the motion vectors were recognized to be of a small dispersion and 
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enough adjusted, they are considered to represent the sanne object of an image and the same weighting coefficients 
are outputted (Step 6. 9). The transformable unit-area, if there is found a large dispersion between the motion vectors, 
is judged to include a plurality of different objects in an image. A vector showing a particular direction is separated and 
put into another matched group (Step 7). Weighing coefficients are decided for each subarea of the transformable unit 

s area in such a way that a nearest neighboring motion vector of a remarked subarea and a motion vector showing a 
matched direction is weighted heavy and the motion vector judged to belong to other group or to show a particular 
direction is weighted light (Steps 8 and 9). 

Referring to a flow chart shown in Fig. 16. a second example of the weight coefficient control portion 11 3 of video 
coding device according to the present invention will be described as follows: 

10 Like the first example, a motion vector detecting portion 112 enters representative motion vectors into a weighting 

coefficient control portion 1 1 3 (Step 1 ) whereby a transformable unit area is formed by four neighboring representative 
points (Step 2). Then, horizontal and vertical component values of four neighboring representative vectors of every 
vertex are compared with each other (Steps 2 and 4) and the motion vectors of the four vertices are examined whether 
they are matched in their directions and magnitudes (Step 5). When the examination result shows the motion vectors 

IS are satisfactorily matched to each other without a large dispersion, the motion vectors are considered to represent the 
same object In an Image and a uniform weighting coefficient is outputted for them (Steps 6 and 9). If there is found a 
large dispersion between the motion vectors, the unit-area to be transformed is judged to include a plurality of different 
objects in an image. A motion vector showing a particular direction and magnitude is separated and put into another 
matched group (Step 7). Weighting coefficients for each subarea of the transformable unit area are decided in such a 

20 way that a nearest neighboring motion vector of a remarked subarea and a motion vector matched thereto are weighted 
heavy and the motion vector judged to belong to other group or to show a particular direction is weighted light (Steps 
8 and 9). 

The second example differs from the first example by using not only vector signs but also vector values itself and 
being therefore capable of more accurately determining a direction of each vector and discriminating two motion vectors 
25 directed in the same direction but having different magnitudes. Namely, more strict processing of the vectors can be 
accomplished. 

Fig. 1 7 shows a first example of a structure of an Interf rame predicting portion composing a video decoding device 
according to the present invention. The video decoding device of Fig. 17 comprises a frame memory portion J 21_for 
st oring already decode d video sional'sTa we^ controrportion 122 for determining and designating_ 

50. weighting coefficients for representative motion vectors inputted for every transforrnable unitnarea. a motion vector" 
interpolating portion 1^^^^^ from the representative motion vectors"aad'lh5 

welgRtihg coefficients, a pixel value predicting portion 124 for preparing a predicted video signal from a video signal 

read frbrfi the frame memory portion 1 21 by using the rfiotion Vector per pixel. " 

The frame memory portion 121 and the pixel value predicting portion 124 are similar in function to those of the 

35 conventional video decoding device. The video decoding device according to the present invention differs from the 
conventional video decoding device in that the weighting coefficient control portion 122 is provided for specifying a 
weighting coefficient for each representative motion vector and a motion vector interpolating portion 123 is used for 
determining and outputting motion vectors pixel by pixel by using representative motion vector and a weighting coef- 
ficient for each vector outputted by the weighting coefficient control portion 122. 

40 The operations of the weighting coefficient control portion 122 and the motion vector interpolating portion 123. 

both of which are essential portions of the embodiment, will be described below However, both portions function sim- 
ilarly to those of the coding device according to the present invention. In other words, the decoding device performs 
the same processing operations that the coding device does. 

The weighting coefficient control portion 122 decides weighting coefficients for respective motion vectors of re- 

45 spective vertices (representative points) for a unit -area to be processed by the motion vector interpolating portion 1 23 
and specifies the coefficients for each subdivision of the transformable unit area. The weighting coefficient control 
portion 1 22 also judges whether motion vectors of vertices of a unit-area for transformation represent a movement of 
the same object in an image, i.e., said unit-area exists on one object in the image or not. When having judged the 
motion vectors concern the same object, the weight coefficient control portion 122 instructs the motion vector interpo- 

so lating portion 1 23 to evenly weight all representative motion vectors of vertices. When having judged any vertex of the 
unit-area existing on different objects, the weighting coefficient control portion 122 decides weighting coefficients in 
such a way that a motion vector of a vertex existing on the same object as a remarked subarea represented by a vertex 
is weighted with a larger weighting coefficient and a motion vector of a vertex judged to exist on a different object is 
weighted with a smaller weighting coefficient. The processing operations and outputs of the weighting coefficient control 

55 portion 1 22 are similar to those of the weighting coefficient control portion 1 1 3 of the video coding device. 

The motion vector interpolating portion 123 receives the designated weighting coefficients from the weighting co- 
efficient control portion 122 and calculates motion vectors for every pixel in the subarea by using the received coeffi- 
cients in the same way as described for weighting motion vectors by the motion vector interpolating portion 11 3 of the 
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video coding device. 

The weighting coefficient control portion 122 nnay have M sets of weighting coefficients previously prepared and 
nnay select one of them as an optimal weight pattern for each unit-area or subarea. which coefficient is nearest to a 
weighting coefficient determined for the unit-area or subarea by the above-mentioned weighting coefficient processing. 

5 This feature is the same as described in the video coding device. In this case, the weighting coefficient control portion 
122 give information about the selected pattern to the motion vector interpolating portion 123. 

The motion vector interpolating portion 1 23 has the same weight patterns as held in the weighting coefficient control 
portion 122, determines motion vectors for pixels by using the corresponding weighting coefficients according to the 
specified weight pattern in the above-mentioned way and outputs the weighted motion vectors. 

10 Fig. 18 shows a second exemplified structure of an intertrame predicting portion of a video decoding device ac- 

cording to the present invention. 

The video decoding device of Fig. 18 comprises a frame memory portion 181 for already decoded video signals, 
a motion vector converting portion 182 for calculating representative motion vectors for an dropped frame from repre- 
sentative motion vectors for a coded frame, a weighting coefficient control portion 183 for determining and indicating 

'5 weighting coefficients of representative motion vectors inputted for a unit area, a motion vector interpolating portion 
184 for calculating a motion vector for each pixel from the corresponding representative motion vector by using the 
weighting coefficients and a pixel value predicting portion 1 85 for preparing a predicted video signal from a video signal 
read from the frame memory portion 181 by using the motion vector per pixel. 

The frame memory portion 1 81 , the weighting coefficient control portion 1 83. the motion vector interpolating portion 

20 184 and the pixel value predicting portion 185 are similar in function to those of the first embodiment of the video 
decoding device. This second embodiment of the video decoding device differs from the first embodiment by provision 
of motion vectors of encoded video frame and of the motion vector converting portion 1 82 for outputting motion vectors 
of video frames omitted without having been encoded. 

The operation of the motion vector converting portion 182 is as follows: 

25 The motion vector converting portion 1 82 internally divides a representative motion vector for an input coded video 

frame according to a temporal distance between two neighboring video frames and a temporal position of an omitted 
video frame. For example, a first frame of a video sequence was first encoded and a fifth frame was next encoded. 
Namely, three frames (second, third and fourth frames) were skipped. Now, representative motion vectors for the fifth 
video frames are inputted. In this case, a temporal distance between the coded frames (i.e., the first frame and the 

30 fifth frame) is 4. As the motion vectors for the fifth frame may represent a movement for 4-f rame time, the vector value 
is proportionally distributed to the dropped frames: motion vectors for the second frame, the third frame and the fourth 
frame are determined by multiplying the fifth-frame motion-vector value by 1 M. 1 /2 and 3/4 respectively. The determined 
values are outputted as the motion vectors for respective dropped frames. 

The outputted motion vectors are transferred to the weighting coefficient control portion 1 83 and the motion vector 

35 interpolating portion 184. wherein the vectors are processed in the same way as in the first embodiment. Finally, pre- 
dicted video-signals are prepared by the pixel value predicting portion 1 85 and outputted as interpolating video-signals 
of the dropped video-frames. 

As is apparent from the foregoing description, the video coding device and the video decoding device according 
to the present invention offers following advantages: 

40 A video coding device is featured in that weighting coefficient control means determines weighting coefficient for 

each representative vector obtained by vector detecting means and give an instruction to motion vector interpolating 
means which can. therefore, accurately determine a motion vector of each pixel with a minimized effect of unrelated 
motion vectors even in the case of mismatches existing between a transformable unit-area and a content or position 
and movement of an object, realizing the improvement of the efficiency of interf rame prediction coding. Accordingly, 

45 the application of this video-coding device can realize very high efficient video-coding system that is desired for use 
in very low bit-rate video-communication. 

The video coding device is featured in that weight coefficient control means selects a suitable one of previously 
prepared and stored patterns of weighting coefficients for each representative motion vector and gives an instruction 
to motion vector interpolating means which can, therefore, accurately determine a motion vector of each pixel with a 

50 minimized effect of unrelated motion vectors in each transformable unit area, achieving a considerable improvement 
of the efficiency of interlrame prediction coding by applying the above-mentioned simplified method. Owing to the 
increased efficiency of video coding with a reduced data amount, this video-coding device can realize saving in hard- 
ware and software and be manufactured with a reduced size at a low cost. 

A video-coding device Is featured in that its weight coefficient control means determines a weighting coefficient 

55 according to only a direction of each representative motion vector, considerably simplifying processing operations of 
the weight coefficient control means. Accordingly, the video-coding device, which has a high efficiency video-interf rame 
prediction method giving an increased efficiency of video coding, has a small size and is inexpensive to manufacture. 
A video-coding device Is featured in that weighting coefficient control means determines a vector weighting coef- 
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ficient according to a vector value of each representative motion vector to enable motion vector interpolating means 
to strictly discriminating motion vectors irrelevant to a pixel in a transformable unit area, achieving an improved quality 
of motion vectors of respective pixels. Accordingly the video-coding device has a high efficiency video-interlrame 
predicting system giving an increased efficiency of video coding enough to realize the encoding of video-signals at a 

5 further decreased bit-rate. 

A video-decoding device is featured in that weighting coefficient control means determines weighting coefficient 
for each representative vector obtained by vector detecting means and give an instruction to motion vector interpolating 
means which can. therefore, accurately determine a motion vector of each pixel with a minimized effect of unrelated 
pixel's motion vector even in the case of mismatches existing between a transformable unit-area and a content or 

10 position and movement of an object, achieving a considerable improvement of the efficiency of interframe prediction 
and an increased quality of decoded video-signals. Accordingly, this video-decoding device can obtain high-quality 
decoded video-signals suitable for use In a very low bit-rate video-communications. 

A video-decoding device Is featured in that weighting coefficient control means selects a suitable one of previously 
prepared and stored patterns of weighting coefficients for each representative motion vector and gives an instruction 

IS to motion vector interpolating means which can. therefore; accurately determine a motion vector of each pixel with a 
minimized effect of unrelated motion vectors in each transformable unit area, achieving a considerable improvement 
of the efficiency of interframe prediction and an Increased quality of decoded video-signals by using the above-men- 
tioned simplified method. Accordingly, this video-decoding device can realize saving in hardware and software and be 
manufactured with a reduced size at a low cost. 

20 A video-decoding device Is featured in that its weight coefficient control means determines a weighting coefficient 

according to only a direction of each representative motion vector, considerably simplifying processing operations of 
the weight coefficient control means. Accordingly, the video-coding device, which has a high efficiency video-interframe 
prediction method giving an increased efficiency of video coding, has a small size and is inexpensive to manufacture. 
A video-decoding device Is featured in that weighting coefficient control means determines a vector weighting 

2S coefficient according to a vector value of each representative motion vector to enable motion vector interpolating means 
to strictly discriminating motion vectors Irrelevant to a pixel in a transformable unit area, achieving an improved quality 
of motion vectors of respective pixels. Accordingly the video-coding device has a high efficiency video-interframe 
prediction system method giving an Increased efficiency of video coding enough to realize the encoding of video-signals 
at a further decreased bit-rate. 

30 A video-decoding device Is featured in that motion vector converting means determines a motion vector for a 

dropped frame from a representative motion vector Inputted for a transformable unit area of encoded frame and pre- 
pares an interpolating video-signal corresponding to a frame skipped without having been encoded. This enables 
obtaining interpolating video-signals according to motion vectors of pixels accurately determined with a minimized 
effect of motion vectors irrelevant to those In a transformable unit-area. Namely, quality and temporal resolution of 

35 restorable images can be remarkably improved. Accordingly, the decoding device can obtain decoded video signals 
giving a smooth motion picture of high image quality even in a very low bit-rate video-communications. 

Referring to accompanying drawings, preferred embodiments of the present invention will be described as follows: 
The general operation of video-coding and video-decoding devices according to the present invention Is the same 
as described before for the conventional video-coding and video-decoding devices. Fig. 5 Illustrates an exemplified 

40 structure of the conventional video-coding device and Fig. 6 illustrates an exemplified structure of the conventional 
decoding device. . 

Accordingly, the operation of a motion-compensated interframe predicting portion of a video-coding device accord- 
ing to the present invention and the operation of a motion-compensated Interframe predicting portion of a video-de- 
coding device according to the present invention will be explained below. 
45 A motion-compensated interframe-predicting portion for use in a video-coding device, which embodies the present 

invention, is described first as follows: 

Referring to Fig. 19. a motion-compensated interframe-predicting portion 217 for use In a video-coding device is 
composed of an effective area selecting portion 231 , a motion-vector searching portion 232. a variable-area predicted- 
frame generating portion 233, an area -dividing pattern deciding portion 234 and a side-information coding portion 235. 
so The effective area selecting portion 231 selects an effective-area selecting mask according to a position of a control 

grid on an Input video-frame and outputs the selected mask to the motion vector searching portion 232. 

The motion-vector searching portion 232 searches motion vectors at control grid points on the Input video-frame 
and a reference video-frame inputted from frame memory 216 by using the effective-area mask specified by the effec- 
tive-area mask selecting portion 231 and outputs the found motion vectors to the variable-area predicted-frame gen- 
ss erating portion 233. 

The variable-area predicted-frame generating portion 233 executes Interframe prediction by using the motion vec- 
tors Inputted from the motion-vector searching portion 232 and the reference video-frame Inputted from the frame 
memory portion 216. 
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Interframe prediction is to deternnine affine paranneters from positions of three vertices of an objective triangular 
; area and motion vectors and then to perform affine transformation on all pixels within the triangular area by using the 
I determined affine parameters. T his processi ng is repeated on all objective triangular areas to generate a predicted 

( video:fram.e. ' ' 

5 As shown in Fig. 20, there are 26 patterns of dividing an area by a triangle. A rectangular area of 16 by 1 6 pixels 

can be divided into 2 triangles (divisions) in 2 patterns. 5 triangles in 8 patterns and 8 triangles in 16 patterns. 

These area-dividing patterns are given numerals 1 to 26 which are referred to as area-dividing pattern numbers. 
The variable-area predicted-frame generating portion 233 can output a predicted video-frame generated by the above- 
mentioned affine transformation as well as a predicted video-frame produced by translational displacement of rectah- 
10 gular areas. The prediction by parallel displacement uses only a representative motion vector for each objective rec- 
tangular area. The representative motion vector is considered to be located at a center of the rectangular area (not be 
necessary at vertex points for motion-vectors for affine-transformation). A predicted video-frame is generated in suclh^ 
a way that the location of a processabte rectangular area within a video-frame is displaced by the representative motion- 
vector value and mapped on the reference video-frame, and the rectangular area thus determined on the reference 
15 video-image is presented as a predicted image for the processable rectangular area. 

An average of four motion-vectors based on four control-grid-points corresponding to four corner points of a proc- 
essable rectangular area can be also used as a representative motion-vector in place of the above-mentioned center 
motion-vector. 

'n?®.?'"®^"^^v'^'"9 pattern deciding portion 234 determjn^^ bstvveen_t]ie pxedic^^^^^ 

20 / ted from the variable-area predicted-frame generating portion 233 and the input video-frame and judges the necessity 
\ of redividing the processable area to obtain a minimized error ihformatidh amourit. ' " ' ' 

The area-dividing pattern deciding portion 234 may use all or a part of 26 patterns shown in Fig. 20 as selectable 
area-dividing patterns. It may select a suitable pattern from a limited number of patterns, for example, 4 patterns Nos. 
1, 3, 4 and 11 each of which Includes triangles each having a hypotenuse directed from the upper right to the down 
25 left, or 4 patterns Nos. 2, 7, 9 and 26 each of which includes triangles each having a hypotenuse directed from the 
upper left to the down right or 8 patterns that are a combination of the above-mentioned patterns Nos. 1 , 2, 3, 5. 7, 9, 
11 and 26. 

It is also possible to uses more finely dividing patterns (of more than 8 divisions) as selectable patterns. A unit 
rectangular area to be divided may be in size of other than 1 6 by 1 6 pixels, for example, 32 by 32 pixels or 8 by 8 pixels. 

Furthermore, the area-dividing pattern deciding portion 234 selectively usesjtwo kinds of predjcted images, i.e., a 
pr ^icted Imag e produced by affine transformation of divided areas and a predicted image generated by translatTonal 
displacement of rectangular areas by the predicted-frame generating portion 233yAccordingly, the area-dividing pattern 
^Qjr!jjn9PQIt[qn,234.i^^ of dividing a processable area, which can minimize error inforrriatiojn 

value, arnong 27 kinds of patterns which includes 26niatteiTi¥sh6wn ihTi^^^^^ a patFern for translatioriai displace- 

35 ment prediction. After determining a suitable area<lividing pattern minimizing an error-information value, the area- 
dividing pattern deciding portion 234 outputs a predicted video-frame to the subtracting portion 1211 and the adder 
portion 1215 and outputs motion-vector information and area-dividing information to the side-information coding portion 
235. The side-information coding portion 235 encodes the motion vectors and the area-dividing information received 
from the area-dividing pattern deciding portion 234 and outputs the coded data as side-information. 
40 The motion-compensated interframe predicting portion 1217 used in the video-coding device can output a series 

of predicted video-frames and side-information by repeating the above-described operations. 

Fig. 21 shows an exemplified structure of a motion-compensated interframe-predicting portion 223 for use in a 
video-decoding device. In Fig. 21, numeral 237 designates a variable-area-size predicted-frame generating portion 
and numeral 238 designates a side-information decoding portion. 
45 Coded side-information inputted into the motion-compensated interframe-predicting portion 223 is transferred to 

the side-information decoding portion 238. 

The side-information decoding portion 238 decodes the received coded side-information, obtains basic vectors, 
additional vectors and area-dividing information and outputs the decoded side-information to the variable-area-size 
predicted-frame generating portion 237. 
50 The variable-area-size predicted-frame generating portion 237 executes interframe prediction by using the motion 

vectors inputted from the side-information decoding portion 238 and a reference preceding video-frame inputted from 
a frame memory portion 224. At this time, the interframe prediction is conducted on each triangle into which an objective 
area was divided according to the area-dividing information. 

The interframe prediction is to determine affine parameters from positions of three vertices of an objective triangular 
ss area and motion vectors and then to perform affine transformation on all pixels within the triangular area by using the 
determined affine parameters. This processing is repeated on all objective triangular areas to generate a predicted 
video-frame. 

The variable-area-size predicted-frame generating portion 237 outputs a predicted video-frame generated by the 
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above-mentioned affine transformation or a predicted video-frame produced by translational displacement of rectan- 
gular areas. The prediction by translational displacement is to map the location of a processable rectangular area 
within a video-frame onto a reference video-frame and displace it by a representative motion-vector value, taking the 
determined rectangular area on the reference video-frame as a predicted image for the processable rectangular area. 
5 The variable-area predicted-frame generating portion 237 outputs the generated predicted video-frame to an adder 

portion 1222. 

The motion-compensated interframe predicting portion 1217 used in the video-coding device outputs a series of 
predicted video-frames by repeating the above-described operations. 

A motion vector searching system according to the present invention is as follows: 
10 Fig. 22 is a view for explaining a control grid and motion vectors. Referring to Fig. 22, setting of control grid points 

is first explained as follows: 

The present invention adopts a variable in size processable area for which control grid points are set as shown in 
Fig. 22. 

In the instance of Fig. 22, control grid points are set for each unit area of 16 by 16 pixels and of 8 by 8 pixels. 
IS In searching motion-vectors, weighting is conducted ?n the same way as the conventional searching method. In 

addition to this, the searching method according to the present invention uses effective-area selecting masks. Namely, 
valid and Invalid parts of each processable area for weighting can be specified by using the effective-area selecting 
mask. 

Fig. 23 shows all kinds of effective-area selecting masks, which are selectively used depending upon locations of 
20 control grid points existing within a video-frame. 

For example, mask 1 is used for searching a motion vector at an upper left control grid point whose coordinates 
are (0. 0). 

Mask 2 is used for searching a motion vector at a control grid point of coordinates (8, 0). mask 3 for searching at 
a point of coordinates (16, 0), mask 6 for searching at a point of coordinates (0, 8). mask 7 for searching at a point of 
25 coordinates (8, 8). mask 8 for searching at a point of coordinates (8, 16), mask 12 for searching at a point of coordinates 
(0, 16), mask 13 for searching at a point of coordinates (8, 16), Similarly, masks 4. 5. 9. 10.14. 16. 17, 18 19, 20. 21, 
22, 23. 24 and 25 are used at the left end and the down end of the video-frame. 

As described above, searching of motion vectors according to the present invention is conducted using only a part 
of a processable area. 

30 Two kinds of motion vector coding methods will be described below. In Fig. 22. there are shown motion vectors in 

a processable area: 

(1) Method 1 

35 Basic motion vectors (MV1 -4) are encoded by encoding a difference of each vector value from just before encoded 

motion vector (this is the same as the conventional case). 

Predicted values are determined from the basic motion vector (MV1 -4) as follows and additional vectors (mvl -5) 
are encoded by coding differences from the respective predicted values. 

PMV1 , PMV2. PMV3. PMV4. PMVS are predicted values of additional motion-vectors mvl , mv2, mv3, mv4 and 
40 nriv5 respectively and have the following respective expressions: 

PMV1 = (MV1 + MV2) / 2 
PMV2 = (MV1 +MV3)/2 
PMV3 = (MV1 + MV2 + MV3 + MV4) / 4 
PMV4 = (MV2 + MV4) / 2 

55 PMVS = (MV3 + MV4) / 2 
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Differences between mvl , mv2. mv3, mv4 and mv5 and PMV1 , PMV2, PMV3. PMV4, PMVS are determined and 
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The differential values dmvl. dmv2, dnnv3, dnnv4 and dmv5 of the respective additional motion -vectors are ex- 
pressed as follows; 

dmv1 = mv1 - PMV1 



dmv2 - mv2 - PMV2 
dnriv3 = nnv3 - PMV3 
dmv4 = mv4 - PMV4 
dmv5 = mv5 - PMV5 

(2) Method 2 



Basic motion vectors (MV1 -4) are encoded by encoding a difference of each vector value from just before encoded 
motion vector (this is the same as the conventional case). 

An average of 4 basic motion vectors (MV1 -4) is taken as a predicted value and differences of respective additional 
motion vectors (mv1-5) from the predicted value are determined and encoded. 
25 PMV is a predicted value and has the following expression: 

PMV = (MV1 + m2 + MV3 + IVIV4) / 4 

30 Differences of mvl , mv2, mv3. mv4 and mv5 from PMV are calculated and encoded respectively. 

The differential values dmvl , dmv2. dmv3. dmv4 and dmv5 of the respective additional motion-vectors are ex- 
pressed as follows: 



dmv1 = mvl - PMV 



dmv2 = mv2 - PMV 



dmv3 = mv3 - PMV 



dmv4 = mv4 - PMV 



dmv5 = mv5 - PMV 

Two kinds of decoding methods for decoding basic motion vectors and additional motion vectors wiii be described 
so as follows: 

(1) Method 1 ■ 

Basic motion vectors (MV1 - MV4) are decoded by decoding a difference of each vector value from just before 
55 decoded motion vector (this is the same as the conventional case). 

Additional motion vectors (mvl -5) are decoded as differences from the respective predicted values determined 
from the basic motion vector (MV1-4). The each differential value is added to a corresponding predicted value. 
The predicted values are set as follows: 
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PMV1. PMV2. PMV3. PMV4, PMV5 are predicted values of additional motion vectors mvl, mv2, mv3, mv4 and 
mv5 respectively and have the following respective expressions: 

PMV1 = (MV1 +MV2)/2 
PMV2 = (MV1 + MV3) / 2 
PMV3 = ((^V1 + MV2 + f^V3 + MV4) / 4 
PI\/1V4 = (MV2 + MV4)/2 
P!VIV5 = (MV3 + MV4)/2 

Differences between mvl , mv2. mv3, mv4 and mv5 and PM VI . PM\/2, PMV3. PIVIV4. PhAVS are determined and 
20 decoded. 

The decoded differential values dmvl, dmv2. dmv3. dmv4 and dmv5 of the respective additional motion-vectors 
are expressed as follows: 



mv1 = PMVl + dmvl 
mv2 = PMV2 + dmv2 
mv3 = PMV3 + dmv3 
mv4 - PMV4 + dmv4 
mv5 = PMV5 + dmv5 

(2) Method 2 



Basic motion vectors (MV1 -4) are decoded as differences from respective just-before decoded motion-vectors 
(this is the same as the conventional case). 

Additional motion vectors (mvl -5) are decoded as differences from a predicted value determined from basic motion 
vectors (MVl -4) and each of the decoded differences is added to the predicted value. 
45 The predicted value is an average of 4 basic motion vectors like the case of encoding the motion vectors. 

Ply/!V is a predicted value and has the following expression: 



55 



PMV = (MV1 + MV2 + MV3 + MV4) / 4 

The decoded differential values dmvl, dmv2, dmv3, dmv4 and dmv5 of the respective additional motion-vectors 
are expressed as follows: 

mvl = PMV + dmvl 



mv2 = PMV + dmv2 
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mv3 = PMV + dmv3 



mv4 = PMV + dnnv4 



mv5 - PMV + dmv5 

TO Another method for coding motion vectors is described as follows: 

The above-described coding method encodes basic motion vectors and then encodes additional motion vectors by 
using information concerning the basic motion vectors whereas the coding method to be explained below can sequen- 
tially encode necessary motion vectors making no distinction of kinds of motion vectors. 

Fig. 24(a) illustrates an example of dividing a processable area according to the present invention. In Fig. 24{a), 

IS there are also shown positions of motion-vectors corresponding to control grid points within a divided processable 
area. These motion vectors must be ail encoded by the side-information coding portion 235 but other motion vectors 
are not encoded since they are not required for generating a predicted video-frame according to the area dividing 
pattern specified by the area-dividing pattern deciding portion 234. The video-coding device according to the present 
invention does not necessarily encode all motion vectors at all control grid points. Namely, necessary motion vectors 

20 and unnecessary motion vectors are identified by the result of dividing a coding video-frame area. This allows selective 
encoding only necessary motion vectors. 

. Referring to Fig. 24(b), a motion-vector coding method including how to determine predicted values of motion 
vectors Is described as follows: 

Motion vectors that exist on the left-side, just above and the upper right-side of an objective motion-vector are first 

25 determined for obtaining a predicted value for the objective motion-vector MV. The left-side motion vectors MVLO and 
MVL1 at two positions LO and LI are examined. When only one of two motion vectors exists, said motion vector is 
decided as the left motion vector. When both motion vectors exist, one nearer to the objective motion vector, i.e. , MVLO 
is selected as the left motion vector. A suitable value, e.g.. (0. 0) is determined as the left motion vector value if no 
motion-vector is found. 

30 Similarly, the upper motion vector is determined as one of motion vectors M VUO and MVU 1 existing at two positions 

UO and U1 respectively and the upper right motion vector is determined as one of MVRO and MVR1 existing at two 
positions RO and R1 respectively. 

A predicted value of the codable object motion-vector is determined from three obtained motion vectors (left, upper 
and upper right vectors). The predicted value can be calculated by any one of the conventional methods, i.e.. by 

35 calculating a mean value of three motion vectors or a median of three motion vectors. A difference between the obtained 
predicted value and the objective motion-vector value is determined and encoded same as the conventional method. 

Fig. 24(c) shows an example of coding motion vectors in the area divided as shown in Fig. 24(b). As mentioned 
above, a left motion-vector MV1 (nearer than MV2). an upper motion-vector MV3 and an upper-right motion-vector 
MV4 are obtained for the objective motion-vector MVO. 

40 A method -for decoding coded motion vectors in the side-information decoding portion 238 of the video-decoding 

device according to the present invention will be explained as follows: 

Three motion-vectors which were decoded and exist left, upper and upper right from an objective motion-vector, 
are determined in the same way as described for the case of encoding the motion vectors. These three motion-vectors 
are identical to those obtained by the coding side. Accordingly a predicted value of the objective motion -vector, which 

45 is determined from said three motion-vectors, is identical to the predicted value obtained at the coding side. In the 
video-decoding device, the side-information decoding portion 238 decodes the coded input value representing a dif- 
ference between the predicted value and the objective motion-vector value and adds the decoded difference value to 
the predicted value, thereby the objective mot ion -vector value can be reproduced. 
Methods for coding and decoding area-dividing information are as follows: 

so In the case of encoding the area-dividing information, each area-dividing information is described with 5 bits since 

each information may take a numeral from 1 to 26 (number of area-dividing patterns). 

However, variable-length coding can be conducted if probability of occurrence of patterns is non-uniformly distrib- 
uted. Namely, area-dividing pattems that may occur at a high frequency are encoded with short -length codes and area- 
dividing patterns that may occur at a low frequency are coded with long-length codes. 

55 In the case of decoding the area-dividing information, the coded area-dividing information is decoded by a variable- 

length decoding method. 

The operation of the area-dividing pattern deciding portion 234 and related portions will be described below. 
Fig. 25 is a flow chart representing a sequence of operations for determining all error-information amount of all 
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kinds of area dividing patterns and outputting an area-dividing pattern and nnotion vectors having a least amount of 
error-information. Fig. 26 is a flow chart representing a sequence of operations for judging the necessity of redividing 
an processable area by comparing error-information amount with a threshold value. 
The flow chart of Fig. 25 is first described as follows: 

5 An effective area for searching motion vectors is selected according to co-ordinates of control grid points and 

motion vectors thereof are searched (Step 1 ). All basic motion vectors and additional motion vectors are searched to 
provide the possibility of processing with all kinds of area<lividing patterns. 

Affine parameters on each triangle of each area are determined and affine transformation and pixel-value inter- 
polation are made for each area to generate a predicted video-frame (Step 2). 

10 At this time, a predicted video-frame produced by translational displacement of rectangular areas is outputted 

together with the predicted video-frame produced by the affine transformation. The translational displacement predic- 
tion is such that a processable objective rectangular area within a video-frame is displaced by its representative motion- 
vector value and nriapped onto a reference video-frame to determine a predicted area thereon. The determined area 
of the reference video-frame is then outputted as a predicted innage. 

IS An error-information amount is calculated for each ar6a-dividing pattern, the calculation results are compared with 

each other and such an area-dividing pattern that may have a minimized amount of error-information is finally selected 
(Step 3). 

As described before, the number of selectable area-dividing patterns may be 26 (all kinds of patterns shown in 
Fig. 20) or be limited to the specified number of patterns, e.g.,: 4 patterns No.1, 3. 5, 11, or 4 patterns No. 2. 7, 9, 26. 
20 or 8 patterns No. 1 , 2. 3, 5, 7, 9. 11 (a combination of the above-mentioned two groups). 

As described before, a predicted image generated by affine transforming a divided area and a predicted image 
generated by translational displacement of a rectangular area may be used. Namely, it is possible to use 27 kinds of 
area patterns including 26 patterns of Fig. 20 and a predicted translational displacement image. 

The number of the selected area-dividing pattern and necessary motion vectors (basic and additional motion vec- 
25 tors) are encoded and the predicted video-frame and the coded side-information are outputted (Step 4). 
The flow chart of Fig. 26 is now explained as follows: 

An effective area for searching motion vectors is selected according to co-ordinates of control grid points and 
motion vectors thereof are searched (Step 1). At this time, either of area-dividing patterns No. 1 and No. 2 is specified. 
(It is also possible to specify both patterns No. 1 and No. 2 and select either one having smaller amount of error- 
30 Information as an initial set number.) 

Affine parameters on each triangle of each area are determined and affine transformation and pixel-value inter- 
polation are made for each area to generate a predicted video-frame (Step 2). 

At this time, a predicted video-frame produced by translational displacement of rectangular areas, is outputted 
together with the predicted video-frame produced by the affine transformation. The translational displacement predic- 
ts tion is such that a processable objective rectangular area within a video-frame is displaced by its representative motion- 
vector value and mapped onto a reference video-frame to determine a predicted area thereon. The determined area 
of the reference video-frame is then outputted as a predicted image. 

An error-infomnation amount is calculated for each area-dividing pattern and each calculation result is compared 
with a preset threshold value (Step 3). If the error-information, amount exceeds the threshold value, the area-dividing 
40 pattern is changed by suitable one. A next processable area-dividing pattern-number is selected among all kinds of 
26 patterns shown in Fig. 20 or from the limited number of area-dividing patterns, e.g., 4 patterns No.1 , 3, 5, 11 . or 4 
patterns No. 2. 7, 9, 26. or 8 patterns No. 1, 2, 3, 5, 7. 9, 11 (a combination of the above-mentioned two groups). 

Any area-dividing pattern is adopted when its error-information amount does not exceed the threshold value (Step 

4). 

45 The number of the selected area-dividing pattern and necessary motion vectors (basic and additional motion vec- 

tors) are encoded and the predicted video-frame and the coded side-information are outputted (Step 5), 
As is apparent from the foregoing description, the present invention offers the following advantages: 

(1 ) This video-coding device is characterized in that it is provided with a variable-area predicted-frame generating 
50 portion for adaptively dividing a processable area according to motion-vectors and a reference video-frame for 

motion-compensated interframe prediction and generating a predicted video-frame by conducting affine transfor- 
mation and an area-dividing pattern deciding portion for controlling dividing of a processable area and outputting 
a predicted video-frame and side-information such as motion vector information and area-dividing information and 
can realize dividing a processable area into an optimal pattern to meet the processable object size and shape, 
55 reducing an error-information amount and effective coding of a sequence of video-frames. 

(2) This video-coding device is characterized in that it is provided with a variable-area predicted-frame generating 
portion for generating a predicted video-frame by parallel displacement of a processable area and generating a 
predicted video-frame by affine transformation of a processable area adaptively divided according to motion-vec- 
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tors and a reference video-frame for motion-compensated interframe predictio^nd an area-dividing pattern de- 
ciding portion for controlling dividing of a processable area and outputting a predicted video-frame and side-infor- 
mation such as motion vector Information and area-dividing information and can realize dividing a processable 
area into areas optimal to the processable object size and shape, preventing the decrease of prediction efficiency 
due to geometrical deformation peculiar to affine transformation, accurate representation of movement even in the 
case of including translational displacement of an area in an input video-image and effective coding of a sequence 
of video-frames with a reduced amount of error-information. 

(3) This video-coding device is featured in that it is provided with an effective-area selecting portion for selecting 
a valid processing mask or an invalid processing mask depending upon location of an objective area in a video- 

»o frame when searching a motion vector and can more accurately search motion vectors at control grid points at 

periphery of a video-frame. 

(4) This video<oding device corresponds to the video-coding device mentioned above (1 ) or (2) with the motion- 
vector searching system mentioned above (3) and can more accurately search motion vectors at control grid points 
at periphery of a video-frame and can achieve more efficient motion-compensated interframe prediction by usina 

'5 the motion vectors. >- / » 

(5) This video-coding device has a side-information coding portion for encoding an additional motion vector as a 
difference from a mean of 4 basic motion-vectors and can save the number of codes used for encodina motion 
vectors,;. 

(6) This.video-coding device corresponds to the video-coding device with the motion-vector coding system men- 
tioned above (5) and can save the number of codes used for encoding motion vectors, divide an objective area 
optimally to the object shape, prevent the decrease of prediction efficiency due to geometrical deformation peculiar 
to affine. transformation, accurately represent the object movement even in the case of including translational 
displacement of an area In an input video-image and effectively encode a sequence of video-frames with a reduced 
amount of error-information. 

(7) This video-coding device is featured in that It is provided with a side-information coding portion for encoding 
an additional motion vector positioned between two basic motion-vectors as a difference from a mean of the two 
basic motion-vectors and encoding a center additional motion-vector as a difference from a mean of four basic 
motion-vectors and can save the number of codes used for encoding motion vectors. 

(8) This video-coding device has, in addition of the features mentioned above (1) or (2), a motion vector coding 
system mentioned above (7) and can save the number of codes for encoding motion vectors when coding side- 
information, optimally divide a processable area to meet the processable object size and shape, prevent the de- 
crease of prediction efficiency due to geometrical deformation peculiar to affine transformation, accurately repre- 
sent the objective movement even in the case of Including translational displacement of an area in an input video- 
image and effectively encode a sequence of video-frames with a reduced amount of error-information. 

(9) This video-coding device is provided with a side-information coding portion for encoding an objective motion- 
vector by prediction from values of three motion-vectors which are already encoded basic or additional motion- 
vectors existing at the left-side, the just upper-side and the upper right-side of the object motion vector and can 
effectively encode motion vectors, even though dispersively located in a video-frame, by using a two-dimensional 
correlation of motion vectors, realizing effectively saving the amount of codes used for coding the motion vectors 

(10) Thisoyideo-coding device having the features mentioned above (9) is capable of determining a mean of the 
three motion vectors as a predicted value of the objective motion-vector and encoding a difference between the 
codable objective motion-vector and the predicted value and can effectively save the amount of codes necessary 
for coding motion vectors by effectively using a specified correlation of an objective motion vector with surrounding 
motion vectors. 

(11) This video-coding device has the features mentioned above (9) and is capable of determining a median of 
the three motion vectors as a predicted value of the objective motion-vector and encoding a difference between 
the objective motion-vector and the predicted value and can effectively save the amount of codes necessary for 
coding motion vectors by prediction encoding of an objective motion vector by using a motion vector value having 
a high correlation with the motion vectors surrounding the objective motion vector. 

(1 2) This video-coding device has the features mentioned above (1 ) or (2) and the motion-vector coding portion 
defined in any one of claims 9, 1 0 and 1 1 and can effectively encode motion vectors with a saved amount of codes 
even though dispersively located in a video-frame, by using two-dimensional correlation of motion vectors when 
coding side-information, optimally divide a processable area to meet the processable object size and shape pre- 
vent the decrease of predictkan efficiency due to geometrical deformation peculiar to affine transformation, accu- 
rately represent the objective movement even In the case of including translational displacement of an area in an 
input video-image and effectively encode a sequence of video-frames with a reduced amount of error-information. 

(13) This video-coding device has the features mentioned above any one of (1). (2), (4). (6). (8) and (12) and is 
further provided with an area-dividing pattern deciding portion for indicating kinds of dividing patterns of all areas 
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to the variable-area predicted-frame generating portion, adopting an area-dividing pattern minimizing a predicted- 
error (error information value), outputting motion-vectors and area-dividing information to side-information coding 
portion and outputting a predicted video-frame. Accordingly, it can determine and output an optimal area dividing 
pattern minimizing the error-information amount by a single pass from all selectable patterns including translational 
displacement. 

(14) This video-coding device has the features mentioned above any one of (1). (2), (4), (6). (8) and (12) and is 
further provided with an area-dividing pattern deciding portion which instructs the variable-area predicted -frame 
generating portion to divide a processable area into two small areas as an initial setting, instructs the the variable- 
area predicted-frame generating portion to more finely divide the area again if a predicted error (error information 
value) of a predicted-frame generated with the initial setting exceeds a preset threshold value, and which deciding 
portion outputs motion -vectors and area-dividing information to the side-Information coding portion when the pre- 
diction error (error information value) became smaller than the.preset threshold value, and also outputs a predicted 
video-frame. It can, therefore, determine and output a suitable area-dividing pattern with a relatively reduced 
amount of calculations. 

(15) This video-coding device has the features mentioned above any one of (1 ). (2), (4). (13) and (1 4) and is further 
provided with: a motion-compensated interframe predicting portion for generating a predicted video-frame from 
an input video-frame and a reference video-frame received from a frame memory, outputting the predicted video- 
frame to a subtracting portion and an adder portion and outputting side-information; the subtracting portion for 
determining a difference between the input video-frame and the predicted video-frame received from the motion- 
compensated interframe-predicting portion and outputting an error (differential) video-frame to a video coding por- 
tion; the video-coding portion for encoding an error (differential) video-frame from the subtracting portion according 
to an instruction of the coding control portion and outputting a coded video information to a video-decoding portion; 
the video-decoding portion for decoding the coded video information received from the video coding portion and 
outputting a decoded video frame to the adder portion; the adder potion for adding the predicted video-frame 
received from the motion-compensated interframe-predicting portion to the decoded video-frame received from 
the video decoding portion and outputting an obtained reference video-frame to the frame memory: and the frame 
memory for storing the reference video-frame received from the adder portion and outputting the stored reference 
video-frame to the motion-compensated interframe-predicting portion. Accordingly, this device can more accurately 
search motion-vectors, optimally divide a processable area to the object size and shape, prevent the decrease of 
prediction efficiency due to geometrical deformation peculiar to affine transformation, accurately represent the 
object movement even in the case of including translational displacement of an area in an input video-image with 
a reduced amount of error-information and effectively encode a sequence of video-frames with a reduced amount 
of codes for coding motion vectors. 

(16) This video-decoding device for reproducing a video-frame from a coded side-information inputted from a 
video-coding device and a reference video-frame inputted from a frame memory Is featured in that a motion- 
compensated Interframe-predicting portion for generating and outputting a predicted video-frame by changing a 
codable area-size is provided with a side-information decoding portion for decoding coded side-information re- 
ceived from the video coding device and obtaining thereby motion-vectors and area-dividing information and a 
variable-area predicted video-frame generating portion for generating a predicted video-frame by using motion 
vectors and area dividing information from the side-information decoding portion and a reference video-frame from 
the frame memory. Accordingly, this device can optimally divide a processable area to the object size and shape, 
accurately represent the object movement even in the case of including translational displacement of an area in 
an input video-image and effectively decode a sequence of video-frames with a reduced amount of error-informa- 
tion. 

( 1 7) This video decoding device can effectively decode coded motion vectors by using its side-information decoding 
portion for decoding four basic motion-vectors contained in the side-information before decoding motion-vectors 
and decoding additional motion-vectors representing a difference from a mean of the basic vectors. 

(18) This video-decoding device has the features mentioned above (16) and the side-information decoding system 
mentioned above (17) and can effectively decode coded motion vectors, optimally divide a processable area to 
the object size and shape, accurately represent the object movement even in the case of including parallel dis- 
placement of an area in an Input video-image and effectively decode a sequence of video-frames with a reduced 
amount of error-information. 

(1 9) This video-decoding device can effectively decode coded motion vectors by using its side-information decod- 
ing portion for decoding four basic motion-vectors contained in the side-information before decoding motion-vectors 
and for decoding an additional motion-vector positioned between two basic motion-vectors as a difference from 
an obtained mean of the two basic motion-vectors and decoding a center additional motion-vector as a difference 
from an obtained mean of four basic motion-vectors; 

(20) This video-decoding device has the features mentioned above (16) and the side-information decoding system 
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defined in claim 19 and can effectively decode coded motion vectors, optimaRvide a processable area to the 
object size and shape, accurately represent the object movement even in the case of including translational dis- 
placement of an area in an input video-image and effectively decode a sequence of video-frames with a reduced 
amount of error-information. 

(21 ) This video-decoding device is provided with a side-information decoding portion which decodes an objective 
motion-vector value by prediction from values of three motion-vectors which are already decoded basic or additional 
motion-vectors existing at the left-side, the just upper-side and the upper right-side of the object motion vector 
Accordingly, it can effectively decode vectors which were dispersively located In a video-frame and encoded with 
a saved amount of codes by using two-dimensional correlation of motion vectors. 

(22) This video-decoding device has the features mentioned above (21) and is provided with a side-information 
decoding portion that determines a mean of the three motion-vectors as a predicted value of the objective motion- 
vector and obtains the decoded objective motion-vector value by adding the decoded difference value to the pre- 
dicted value. Accordingly, it can effectively decode vectors which were encoded by effectively using a specified 
correlation of an objective motion vector with surrounding motion vectors. 

(23) This video-decoding device has the features mehtioned above (21) and is provided with a side-information 
decoding portion for determining a median of the three motion^ectors as a predicted value of the objective motion- 
vector and obtaining a decoded objective motion^ector value by adding the decoded difference value to the pre- 
dicted value. It can effectively decode the objective motion vector that was predictlvely encoded by using a motion 
vector value having a high correlation with the motion vectors surrounding the objective motion vector 

(24) This video-decoding device has the features mentioned above (16) and Is provided with the motion vector 
decoding system mentioned above any one of (21), (22) and (23). It can effectively decode motion vectors that 
were dispersively located In a video-frame and encoded by using two-dimensional correlation of motion vectors 
optimally divide a processable area to meet the processable object size and shape, accurately represent the ob- 
jective movement even in the case of Including parallel displacement of an area in an input video-image and 
effectively decode a sequence of video-frames with a reduced amount of error-information. 

(25) This video-decoding device has the features mentioned above any one of (16), (18), (20) and (24) and is 
further provided with a video-decoding portion for decoding a coded video-information inputted from a video coding 
device and outputting the decoded video-Information, an adder portion for adding a decoded video-frame inputted 
from the video decoding portion to a predicted video-frame Inputted from the motion-compensated Interframe 
predicting portion and outputting an obtained video-frame as an output video-frame and a frame memory for storing 
a video-frame from the adder portion and for outputting said stored video-frame as a reference video-frame to the 
frame inputted from the motion-compensated interframe predicting portion. Accordingly, It can effectively decode 
coded motion vectors and coded areas optimally divided to the object size and shape, accurately represent the 
objective movement even In the case of including translatbnal displacement of an area In an Input video-image 
and effectively decode a sequence of video-frames with a reduced amount of error-information. 

Referring to accompanying drawings, preferred embodiments of the present Invention will be described as follows- 

The general operation of video-coding and video-decoding devices according to the present Invention Is the same 
as described before for the conventional video-coding and video-decoding devices shown in Figs. 5 and 6. 

Fig. 5 illustrates an exemplified structure of the conventional video-coding device and Fig. 6 illustrates an exem- 
plified structure of the conventional decoding device. 

Accordingly, the operation of a motion-compensated interframe predicting portion of a video-coding device accord- 
ing to the present invention and the operation of a motion-compensated interframe predicting portion of a video-de- 
coding device according to the present invention will be explained below. 

Referring to Fig. 27, a motion-compensated Interframe-predicting portion 317 for use In a video-coding device 
which embodies the present invention, is described first as follows: 

In Fig. 27, a motion-compensated Interframe-predicting portion 317 for use in a video-coding device comprises a 
motion-vector searching portion 331 . predicting portions 332a, 332b, ... 332n, an area prediction deciding portion 333 
and a side-information coding portion 334. 

The motion-vector searching portion 331 searches motion on the Input video-frame with reference to a reference 
video-frame received from memory frame portion 316 and outputs the found motion vectors to the predicting portions 
No. 1 - n (332a - 332n). 

The predicting portions No. 1 - n (332a - 332n) which are different from each other by predicting methods generate 
respective predictive images by executing different kinds (No. 1 - n) of motion-compensated interframe prediction by 
using the motion vectors received from the motion-vector searching portion 331 and the reference video-frame received 
from the frame memory portion 316. 

At this time, each of the predicting portions No. 1 - n (332a - 332n) divides the received reference video-frame into 
one or more small areas and performs interframe prediction processing operations on each of small areas which will 
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be hereinafter referred to as^prbcessable or processed small areas. 

Each of the predicting portions No. 1 - n (332a - 332n) transfers the produced predicted images (for one or more 
small processable areas) and the motion-vectors used for interframe prediction to the area prediction deciding portion 
333. 

5 The area prediction deciding portion 333 determines a difference of the input video-frame from each of the pre- 

dictive images (No. 1 - n) received from predicting portions No. 1 - n (332a - 332n). selects an optimal one (with a 
minimal error) of the predictive images differently predicted for each small processable area, and produces a predicted 
video-frame composed of the selected predictive small-areas images having the minimized error. The area prediction 
deciding portion 333 outputs the composed image as a predicted video-frame and transfers the side-information In- 

10 eluding motion vectors, area-information and prediction mode information, etc., to the side-information coding portion 
334. 

The side-information coding portion 334 encodes the side-Information (motion vectors, area-information, predic- 
tion-mode information) received from the area prediction deciding portion 333 and outputs the coded side-information. 

Fig. 28 shows an exemplified structure of a motion-compensated interframe-predicting portion 323 for use in a 
IS video-decoding device. In Fig. 28. numeral 335 denotes a side-information decoding portion, numeral 336a. 336b ... 
336n denote predicting portions 1 . 2 ... n respectively and numeral 337 denotes an area prediction selecting portion. 

Coded side-information inputted into the motion-compensated interframe-predicting portion 323 is transferred to 
the side-information decoding portion 335. 

The side-information decoding portion 335 decodes the received coded side-infornnation, obtains motion vectors, 
20 area-information and prediction modes and supplies the motion vectors to the predicting portions No. 1 - n (336a - 
336n) and outputs the area-information and prediction mode information to the area prediction selecting portion 337. 

The predicting portions No. 1 - n (336a - 336n) which are different from each other by predicting methods, generate 
respective predictive images by executing different kinds (No. 1 - n) of motion-compensated interframe prediction by 
using the motion vectors received from the side-information decoding portion 335 and the reference video4rame re- 
2S ceived from the frame memory portion 31 6. At this time, each of the predicting portions No. 1 - n (336a - 336n) divides 
the received reference video-frame Into one or more small areas and perform the interframe-predlctlon processing 
operation on each of small areas. 

Each of the predicting portions No. 1 - n (336a - 336n) transfers the produced predicted images (for one or more 
small processable areas) to the area prediction selecting portion 337. 
30 The area prediction selecting portion 337 generates a predicted video-frame by selecting the predicted images 

No. 1 - n received from the predicting portions No 1 - n (336a - 336n) according to the area-information and prediction 
mode information received from the side-information decoding portion 335. The area prediction selecting portion 337 
outputs the thus obtained predicted video-frame. 

The predicting portion Is described as follows: 
35 Fig. 29 shows an example of selecting a prediction method. 

Any conventional prediction system uses only one prediction method for motion-compensated interframe prediction . 
for a certain area of an image. Therefore, it is apt to finely divide the area, resulting In Increasing the amount of side- 
information (e.g.. motion vectors) as shown in Fig. 29. 

The prediction system according to the present invention divides an area of an Image into small areas and applies 
40 different kinds of prediction methods to the small areas for motion-compensated Interframe prediction. 

As shown, for example, part (A) in Fig. 29, a small area 1 (netted) and a small area 2 consisting of two triangles 
are differently processed by applying different kinds of motion-compensated Interframe prediction, i.e., the small area 
1 Is processed by block-displacement and the small area 2 Is processed by affine transformation. In this instance, the 
small area 1 uses an upper left motion vector. 
45 As apparent from Fig. 29. the application of different kinds of motion-compensated Interframe-prediction methods 

can reduce the number of necessarily used motion-vectors from 7 (part (A) in Fig. 29) to 4 (part (B) in Fig. 29). 

In the shown case, the predicting portion 1 (332a) divides a unit-area of 16 by 16 pixels Into small areas and 
process each small area by the block displacement method for motion -compensated Interframe prediction while the 
predicting portion 2 (332b) divides the unit-area into small areas and process each small area by the affine transfer- 
ee mation method for motion -compensated interframe prediction. 

As described above, the present invention makes it possible to process not only unit -size areas but areas of different 
sizes by applying different kinds of methods for motion-compensated interframe prediction, e.g.. overlapped motion- 
compensative method, bilinear transformation method, background prediction method in addition to the above-men- 
tioned block-displacement and affine transformation methods. 
ss A predicting portion that works by the affine-transformation method Is described as follows: 

Fig. 30 shows an example of dividing an area in the case of conducting the motion-compensative interframe pre- 
diction by using the affine transformation technique. In the case of motion-compensated interframe prediction by the 
affine transformation, a unit area of 16 by 16 pixels Is divided into a plurality of triangles to be processed. Fig. 31 shows 
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examples of dividing a unit -area of 16 by 16 pixels into a plurality of triangular areas' 

In conducting motion-compensated interframe-prediction of an object shown in Fig. 30 by using the affine trans- 
formation, the area can be divided into 5 triangles as shown part (A) in Fig. 30 or it can be diagonally divided into 4 
triangles as shown part (B) In Fig. 30. As is apparent from two figures, the number of motion-vectors necessary for 
5 prediction processing can be reduced from 7 in the former usual case to 5 in the latter case. 

Even in the case when an area is divided diagonally into a small number of subareas. e.g.. four areas (in pattern 
27 of Fig. 31) or two areas (in pattern 1 or 2 of Fig. 31), a sufficiently high efficiency of a video-frame prediction with a 
reduced amount of side-information (e.g., with the reduced number of motion vectors) can be achieved by generating 
a plurality of predictive images by applying different prediction methods to each smalt areas. 
10 As is apparent from the foregoing description, the present invention offers the following advantageous effects: 

(1) A video coding device comprises a predicting portion for generating a plurality of predictive images of small 
variable-size areas by applying different predicting methods to each of the small variable-size areas in the process 
of the motion-compensated interframe prediction and outputting the generated predictive images, an area predic- 

15 tion deciding portion for determining an adaptive ar6a size and an adaptive prediction method according to a 

plurality of the predictive images received from the predicting portion and outputting side-information composed 
of area-information, prediction-mode information, motion-vectors and so on, and a side-information coding potion 
for encoding the side-information outputted from the area prediction deciding portion. Accordingly this device can 
selectively use. among a plurality of prediction methods for motion-compensated Interframe-prediction, an adaptive 

20 one totninimize the amount of side-information or prediction error value. 

(2) A video coding device comprises a predicting portion for generating a plurality of predictive images of respective 
unit areas or further-divided subareas by applying different predicting methods to each of the unit areas or each 
of the subareas In the process of the motion-compensated interframe prediction and then outputting the generated 
predictive images, an area prediction deciding portion for determining an area-information indicating whether the 

25 predicted Images received from the predicting portion are unit-area images or subarea Images and prediction 

methods applied to the respective predicted images and outputting side-information including area-information, 
prediction mode Information, motion-vectors and the like, and a side-Information coding portion for encoding the 
side-information received from the area prediction deciding portion. Accordingly the device can selectively use, 
among a plurality of prediction methods for motion-compensated Intertrame-predlctlon. an adaptive one to minimize 

30 the amount of side-information or prediction error value. 

(3) A video coding device comprises a predicting portion for generating a plu rality of predictive images of respective 
variable-size small areas by using a block-displacement (overlap motion-compensative) predicting method, an 
affine transform predicting method, bilinear transform predicting method and a background predicting method, etc. 
in the process of the motion-compensated interframe prediction and then outputting the generated predictive im- 

35 ages, an area prediction deciding portion for determining an adaptive area-size and an adaptive prediction method 

from the predictive images received from the predicting portion and outputting side-information Including area- 
information, prediction mode information and motion-vectors, etc.. and a side-information coding portion for en- 
coding the side-information received from the area prediction deciding portion. This device, therefore, can selec- 
tively use a plurality of prediction methods for motion-compensated interframe prediction coding so as to minimize 

40 the amount of side-information or prediction error value. 

(4) A video coding device is provided with a predicting portion for diagonally dividing each encodable unit-area 
into two or four subareas. producing a predictive image for each subarea by affine transformation in the process 
of the motion-compensated interframe prediction and outputting the produced predicted images, and, therefore, 
can reduce the amount of the side-information (motion vectors and so on) or the prediction error value in accordance 

45 with the shape, size and location of an object in the video-frame. This could not be realized by the conventional 

system. 

(5) A video coding device has the same functions as mentioned above any one of items (1), (2) and (3) and Is 
provided with a predicting portion for diagonally dividing each encodable unit-area Into two or four subareas, pro- 
ducing a predictive image for each subarea by affine transformation In the process of the motion-compensated 

50 interframe prediction and outputting the produced predicted images. Accordingly, the device can selectively use 

a plurality of prediction methods for motion-compensated interframe prediction coding so as to minimize the amount 
of side-Information or prediction error value. It can also reduce the amount of the side-Information (motion vectors 
and so on) or the prediction error value in accordance with the shape, size and location of an object in the video- 
frame. This could not be realized by the conventional system. 

55 (6) A video decoding device comprises a side-information decoding portion for decoding coded side-Information 

including area-information, prediction mode information and mot ion -vectors, etc. and outputting the decoded side- 
Information, a predicting portion for generating a plurality of predictive images of small variable-size areas by 
applying different predicting methods to each of the small variable-size areas in the process of the motlon-com- 
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pensated interframe prediction and outputting the generated predictive images, and an area-prediction-mcxJe se- 
lecting portion for generating an adaptive predictive image from a plurality of the predictive images received from 
the predicting portion according to the information received from the side-information decoding portion. Accordingly, 
this device can selectively use. among a plurality of prediction methods for motion<ompensated interf rame-pre- 
s diction, an adaptive one to minimize the amount of side-information or prediction error value. 

(7) A video decoding device comprises a side -information decoding potion for decoding coded side-information 
including area-information, prediction mode information and motion-vectors, etc. and outputting the decoded side- 
information, a predicting portion for generating a plurality of predictive Images of respective decodable unit-areas 
or further-divided subareas by applying different predicting methods to each of the unit areas or each of the sub- 

10 areas in the process of the motion-compensated interframe prediction and then outputting the generated predictive 

images, and an area-prediction-mode selecting portion for generating an adaptive predictive image from a plurality 
of the predictive Images received from the predicting portion according to the information received from the side- 
Information decoding portion. Accordingly, this device can selectively use, among a plurality of prediction methods 
for motion-compensated interframe-prediction, an adaptive one to minimize the amount of side-information or 

IS prediction error value. 

(8) A video decoding device comprises a side-information decoding potion for decoding coded side-information 
including area-information, prediction mode information and motion-vectors, . etc. and outputting the decoded side- 
information, a predicting portion for generating a plurality of predictive images of respective decodable unit-area 
or further divided subareas by using a block-displacement (overlapped motion-compensative) predicting method, 

20 an affine transform predicting method, bilinear transform predicting method and a background predicting method, 

etc., in the process of the motion-compensated interframe prediction and then outputting the generated predictive 
images, and an area-prediction-mode selecting portion for generating an adaptive predictive image from a plurality 
of the predictive images received from the predicting portion according to the infonmation received from the side- 
information decoding portion. Accordingly this device can selectively use. among a plurality of prediction methods 

2S for motion-compensated interframe-prediction, an adaptive one to minimize the amount of side-information or 

prediction error value. 

(9) A video decoding device is provided with a predicting portion for diagonally dividing each decodable unit-area 
into two or four subareas, producing a predictive image for each subarea by affine transformation in the process 
of the motion-compensated interframe prediction and outputting the produced predicted images and, therefore, 

30 can reduce the amount of the side-information (motion vectors and so on) or the prediction error value in accordance 

with the shape, size and location of an object in the video-frame. This could not be realized by the conventional 
system. 

(10) A video decoding device has the same functions as mentioned above any one of items (7) and (8) and Is 
provided with a predicting portion for diagonally dividing each decodable unit-area into two or four subareas. pro- 

55 ducing a predictive image for each subarea by affine transformation in the process of the motion-compensated 

interframe prediction and then outputting the produced predicted images. Accordingly, the device can selectively 
use a plurality of prediction methods for motion-compensated interframe prediction coding so as to minimize the 
amount of side-information or prediction error value. It can also reduce the amount of the side-information (motion 
vectors and so on) or the prediction error value in accordance with the shape, size and location of an object in the 

40 video-frame. This could not be realized by the conventional system. 



Claims 

1 . A video coding device comprising frame memory means for storing already encoded video signals, motion vector 
detecting means (112) for determining a representative motion vector per unit area from a difference between an 
input video signal and a video signal read from the frame memory means (111 ), motion vector interpolating means 

(114) for determining a motion vector per pixel from the representative motion vector, pixel value predicting means 

(115) for preparing a predicted video signal from a video signal read from the frame memory means ( 1 11 ) by using 
the motion vector per pixel, characterized in that weighting coefficient control means (113) for instructing the vector 
interpolating means (114) to weight a representative motion vector is provided for determining a weighting coeffi- 
cient for each representative motion vector determined by the motion vector detecting means (112) and instructing 
the motion vector interpolating means (114) to weight each representative motion vector. 



55 2. 



A video-coding device as defined in claim 1, characterized in that the weighting coefficient control means (113) 
selects one of previously prepared patterns of weighting coefficients for representative vectors and instructs the 
motion vector interpolating means (114). 
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A video-coding device as defied in any one of claims 1 and 2, characterized i^lTat the weight coefficient control 
means (113) determines a vector weighting coefficient according to a direction of each representative motion vector. 

4. A video-coding device as defined in any one of claims 1 and 2. characterized in that the weight coefficient control 
means (113) determines a vector weighting coefficient according to a vector value of each representative motion 
vector. 

5. A Video decoding device comprising frame memory means for storing already decoded video signals, motion vector 
interpolating means (123. 184) for detemiining a motion vector per pixel from a representative motion vector in- 
putted per unit area, pixel value predicting means (124. 185) for preparing a predicted video signal from a video 
signal read from the frame memory means (121. 181) by using the motion vector per pixel, characterized in that 
weighting coefficient control means (122, 183) for Instructing the motion vector interpolating means (123 184) to 
weight a representative motion vector is provided for detemiining a weighting coefficient for each representative 
motion vector and instructing the motion vector interpolating means (1 23, 1 84) to weight each representative motion 
vector 

6. A video-decoding device as defined in claim 5, characterized in that the weighting coefficient control means (1 22 
183) selects one of previously prepared patterns of weighting coefficients for representative vectors and instructs 
the motion vector interpolating means (123. 184). 

A video-decoding device as defined in any one of claims 5 and 6. characterized in that motion vector converting 
means (182) is provided for determining a motion vector for a skipped frame from a representative motion vector 
inputted for a unit area of encoded image frame and an interpolating video signal corresponding to an image frame 
skipped without being encoded is prepared. 

A video coding device for encoding predicted error-information determined as a difference between an input video- 
frame and an predicted video-frame obtained by executing motion-compensated interf rame prediction character- 
ized in that a motion-compensative interframe predicting portion (217, 1217) is provided with a variable-area pre- 
dicted-frame generating portion (233) for dividing a processable area of the input video-frame into suitable areas 
according to motion vectors and a reference video-frame and generating a predicted video-frame by affine trans- 
formation and a area dividing pattern deciding portion (234) for controlling dividing of a processable area and 
outputting a predicted video-frame and side-information such as motion vectors and area-dividing information. 

9. A video coding device for encoding predicted error infomiation determined as a difference between an input video- 
frame and an predicted video-frame obtained by executing motion-compensated interframe prediction character- 
ized in that a motion-compensated interframe-predicting portion (217, 1217) is provided with a variable-area pre- 
dicted-frame generating portion (233) for dividing a processable area of the input video-frame into suitable areas 
according to motion vectors and a reference video frame and generating a predicted video-frame by affine trans- 
formation and a predicted video-frame by translational displacement of processable areas and an area dividinq 
pattern deciding portion (234) for controlling dividing of a processable area and outputting a predicted video^rame 
and «ide -information such as motion vectors and area dividing information. 

10. A video coding device for encoding predicted error information determined as a difference between an input video- 
frame and a predicted video-frame obtained by executing motion^ompensated interframe prediction character- 
ized in that a motion-compensated interframe-predicting portion (217. 1217) is provided with an effective-area 
selecting portion (23 1 ) for selecting a valid processing mask or an invalid processing mask depending upon location 
of an objective area in a video-frame when searching a motion vector 

A video coding device as defined in any one of claims 8 and 9. which encodes predicted error information deter- 
mined as a difference between an input video-frame and a predicted video-frame obtained by executing motion- 
compensated interframe prediction, characterized in that the motion-compensated interframe-predicting portion 
(217. 1 21 7) is provided with an effective-area selecting portion (231 ) for selecting a valid processing mask or an 
invalid processing mask depending upon location of an objective area in a video-frame when searchinq a motion 
vector. 



11. 



12. A video coding device for encoding predicted error information determined as a difference between an input video- 
frame and a predicted video-frame obtained by executing motion-compensated interframe prediction character- 
ized in that a motion-vector coding portion is provided with a side-information coding portion (235) for encoding 
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an additional motion vector as a difference from a mean of four basic motion-vectors. 

13. A video coding device as defined in any one of claims 8 and 9. characterized in that the motion-vector coding 
portion of the area-dividing pattern deciding portion is provided with a side-information coding portion (235) for 
5 encoding an additional motion vector as a difference from a mean of four basic motion -vectors. 



14. A video coding device for encoding predicted error Information determined as a difference between an input video- 
frame and a predicted video-frame obtained by executing motion-compensated interframe prediction, character- 
ized in that a motion-vector coding portion is provided with a side-information coding portion (235) for encoding 
an additional motion vector positioned between two basic motion-vectors as a difference from a mean of the two 
basic motion-vectors and encoding a center additional motion-vector as a difference from a mean of four basic 
motion-vectors. 



15. A video coding device as defined in any one of claims 8 and 9, characterized in that the motion-vector coding 
portion of the area-dividing pattern deciding portion i& provided with a side-information coding portion (235) for 
encoding an additional motion vector positioned between two basic motion-vectors as a difference from a mean 
of the two basic motion-vectors and encoding a center additional motion-vector as a difference from a mean of 
four basic motion-vectors. 



20 16. A video coding device for encoding predicted error information determined as a difference between an input video- 
frame and a predicted video-frame obtained by motion-compensated interframe prediction, characterized in that 
a motion-vector coding portion is provided with a side-information coding portion (235) for encoding an objective 
motion-vector value by prediction from values of three motion-vectors which are already encoded basic or additional 
motion-vectors existing at the left-side, the just upper-side and the upper right-side of the object motion vector. 

17. A video coding device as defined in claim 16, characterized in that the motion-vector coding portion determines a 
mean of the three motion-vectors as a predicted value of the objective motion-vector and encodes a difference 
between the objective motion-vector and the predicted value. 

30 18. A video coding device as defined in claim 16, characterized in that the motion-vector coding portion determines a 
median of the three motion-vectors as a predicted value of the objective motion-vector and encodes a difference 
between the objective motion-vector and the predicted value. 



19. A video coding device as defined in any one of claims 8 and 9. characterized in that the area-dividing pattern 
35 deciding portion (234) Is provided with the motion-vector coding portion defined in any one of claims 16. 17 and 18. 

20. A video coding device as defined in any one of claims 8, 9, 11 , 13, 15 and 19, characterized in that the motion- 
compensated interframe-predicting portion (217, 1217) is provided with an area-dividing pattern deciding portion 
(234) for instructing kinds of dividing patterns of all areas to the variable-area predicted-frame generating portion 

40 (233). adopting an area-dividing pattern minimizing a prediction error (error information value), outputting motion- 

vectors and area-dividing information to side-information coding portion and outputting a predicted video-frame. 



21. A video coding device as defined in any one of claims 8, 9, 11, 13, 15 and 19, characterized in that the motion- 
compensated interframe-predicting portion (217. 1217) is provided with an area-dividing deciding portion (234) 
which instructs the variable-area predicted-frame generating portion (233) to divide an area into two small areas 
as an initial setting and/or make prediction by translationa! displacement, instructs the variable-area predicted- 
frame generating portion (233) to more finely divide the area again if a predicted error (error Information value) of 
a predicted-frame generated with the initial setting exceeds a preset threshold value, and which deciding portion 
(234) outputs motion-vectors and area-dividing information to the side-information coding portion (235) when the 
prediction error (error information value) became smaller than the preset threshold value, and also outputs a pre- 
dicted video-frame. 



22. A video coding device as defined in any one of claims 8. 9. 11. 20 and 21. which Is provided with a subtracting 
portion whereto the input video frame and the predicted video-frame from the motion-compensated interframe- 
predicting portion (217. 1217) are transferred, a video coding portion (1212) for encoding an error (differential) 
video-frame received from the subtracting portion (1211) according to an instruction of the coding control portion 
(1213), a video decoding portion (1214) for decoding a coded video frame from the video coding portion (1212), 
an adder potion (1215) for adding a predicted video-frame received from the motion-compensated interframe- 
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predicting portion (217, 1217) to a decoded video-frame received from the video decoding portion (1214) and a 

stored information to the motion-compensated interframe-predicting portion (21 7, 1217). 

l"!?.'' h'°^'"? "^^"""^ reproducing a video-frame from a coded side-information inputterffrom a video-coding 
device and a reference video-frame inputted from a frame memory (224, 1224). characterized in that a motion 
T!^ITT '"^®^^t';"«-P^«<^i^'i"9 portion (223, 1223) for producing and outputting a predicted video-frame with 
a changed processable area-size is provided with a side-infomiation decoding portion (238) for decoding coded 

frlJZ r^"*!? """^ ^"^'"^ ^"^ °'''^'"'"9 motion-vectors and area-dividing 

formation and a variable-area predicted video-frame generating portion (237) for generating a predicted video 

JSf il"""^ ^"^ ^'^^ '^'^"^'"S information received from the side-information decoding portion 

(238) and a reference video-frame received from the frame memoiy. 

1"!!?° '^^"'''^ ^^P^°^"'='"9 « video-frame from a coded side-information inputted from a video-coding 

ll, ^"'^^^e'erence v.deo-f rame inputted from a frame memory (224. 1 224), characterized in that a side-infor- 
matron decoding portion (238) is provided for decoding four basic motion^ectors contained in the side-information 
before decoding motion-vectors and for decoding additional motion-vectors representing a difference from a mean 



25. A Video decoding device as defined in claim 23. characterized in that the side-information decoding portion (238) 

.^'i' T in the side-information before decoding motion-vectors and for de- 

coding additional motion -vectors representing a difference from a mean of the basic vectors. 

26. A video decoding device for reproducing a video-frame from a coded side-information inputted from a video-codinq 
device and a reference video-frame inputted from a frame memory (224. 1224), characterized in that a side-infor 

'^'"^ ^ ^^^^ "'"^'"'^ '^^^^^'"S ''^^'^ motion^ectors contained in the side-information 
beforedecoding motion-vectors and for decoding an additional motion-vector positioned between two basic motion- 
vectors as a difference from an obtained mean of the two basic motion-vectors and decoding a center additional 
motion-vector as a difference from an obtained mean of four basic motion-vectors. 

27. A video decoding device as defined in claim 23i characterized in that a side-information decoding portion (238) 
decodes four basic motion-vectors contained in the side-information before decoding motion-vectors anddecodes 
an additional motion-vector positioned between two basic motion-vectors as a difference from an obtained mean 
of the two basic motion-vectors and decoding a center additional motion-vector as a difference from an obtained 
mean of four basic motion-vectors. 

28. A video decoding device for reproducing a video-frame from a coded side-information inputted from a video-coding 
device and a reference video-frame inputted from a frame memory (224, 1224), characterized in that a side-infor- 
mation decoding portion (238) is provided for decoding a objective motion-vector value by prediction from values 
of three motion-vectors which are already decoded basic or additional motion-vectors existing at the left-side the 
just upper-side and the upper right-side of the object motion vector 

29. A video decoding device as defined in claim 28. characterized in that the side-information decoding portion (238) 
determines a mean of the three motion-vectors as a predicted value of the objective motion-vector and obtains a 
decoded objective motion-vector value by adding the decoded difference value to the predicted value. 

30. A video decoding device as defined in claim 28, characterized in that the side-information decoding portion (238) 
determines a median of the three motion vectors as a predicted value of the objective motion-vector and obtains 
a decoded objective motion-vector value by adding the decoded difference value to the predicted value. 

31. A video decoding device as defined in claim 23, characterized in that the side-information decoding portion (238) 
IS provided with the motion-vector decoding system as defined in any one of claims 28. 29 and 30. 

32. A video decoding device as defined in any one of claims 23. 25. 27 and 31 . characterized In that it is furtherprovided 
with a video decoding portion (1 221 ) for decoding a coded video-information inputted from a video coding device 
and outputting the decoded video-information, an adder portion (1222) for adding a decoded video-frame inputted 
rom the video decoding portion (1221) to a predicted video-frame inputted from the motion-compensated inter- 
frame predicting portion (233. 1 223) and outputting an obtained video-frame as an output video-frame and a frame 
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memory (224. 1224) for storing a video-frame from the adder portion (1222) and for outputting said stored video- 
frame as a reference video-frame to the motion-compensated Interframe predicting portion (233, 1223). 

33. A video coding device using an adaptive motion-compensated Intertrame-prediction system, comprising: a pre- 
dicting portion (332) for producing a plurality of predictive images of variable-size areas by applying different pre- 
dicting methods to each of the variable-size areas in the process of the motion-compensated interframe prediction 
and outputting the produced predictive images; 

an area prediction deciding portion (333) for determining an adaptive area size and an adaptive prediction method 
according to a plurality of the predictive Images received from the predicting portion (332) and outputting side- 
Information including area-information, prediction-mode inforrhation, motion-vectors and so on; and a side-infor- 
mation coding portion (334) for encoding the side-information outputted from the area prediction deciding portion. 

34. A video coding device using an adaptive motion-compensated interframe-prediction system, comprising: 

a predicting portion (332) tor producing a plurality of predictive images of respective unit areas or further- 
divided subareas by applying different predicting methods to each of the unit areas or each of the subareas 
in the process of the motion-compensated interframe prediction and then outputting the produced predictive 
images; 

an area prediction deciding portion (333) for determining an area-information Indicating whether the predicted 
images received from the predicting portion (332) are unit^area Images or subarea images and prediction 
methods applied to the respective predicted images and outputting side-information including area-informa- 
tion, prediction mode information, motion-vectors and so on; and 

a side-information coding portion (334) for encoding the side-information received from the area prediction 
deciding portion (333). 

35. A video coding device using an adaptive motion-compensated interframe-prediction system, comprising: 

a predicting portion (332) for producing a plurality of predictive Images of respective variable-size areas by 
using a block-displacement (overlapped motion-compensative) predicting method, an affine transform pre- 
dicting method, bilinear transform predicting method and a background predicting method, etc. in the process 
of the motion-compensated interframe prediction and then outputting the produced predictive images; 
an area prediction deciding portion (333) for determining an adaptive area-size and an adaptive prediction 
method according to a plurality of the predictive Images received from the predicting portion (332) and out- 
putting side-information Including area-Information, prediction mode information and motion-vectors and so 
on; and 

a side-Information coding portion (334) for encoding the side-information received from the area prediction 
deciding portion (333). 

36. A video coding device using an adaptive motion-compensated interframe-prediction system, which is provided 
with a predicting portion (332) for diagonally dividing each encodable unit -area Into two or four subareas, producing 
a predictive image for each subarea by affine transformation in the process of the motion-compensated interframe 
prediction and outputting the produced predicted Images. 

37. A video coding device using adaptive motion-compensated Interframe-prediction system, as defined in any one 
of claims 33. 34 and 35. characterized in that it is provided with a predicting portion (332) for diagonally dividing 
each encodable unit-area into two or four subareas, producing a predictive image for each subarea by applying 
different prediction methods in the process of the motion-compensated interframe prediction and outputting the 
produced predicted images. 

38. A video decoding device using an adaptive motion -compensated interframe-prediction system, comprising: 

a side-information decoding potion (335) for decoding coded side-information including area-information, pre- 
diction mode information and motion-vectors, and so on and outputting the decoded side-information; 
a predicting portion (336) for producing a plurality of predictive Images of variable-size areas by applying 
different predicting methods to each of the variable-size areas In the process of the motion-compensated 
Interframe prediction and outputting the produced predictive images; and 

an area-prediction-mode selecting portion (337) for producing an adaptive predictive image from a plurality of 
the predictive Images received from the predicting portion (336) according to the information received from 
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the side-information decoding portion (335) and outputting the produced pT^ictive images, 

39. A video decoding device using an adaptive motion-compensated interframe-predictlon system, comprising: 

a side-information decoding potion (335) for decoding coded side-information including afea-information pre- 
diction mode information and motion-vectors, and so on and outputting the decoded side-information ' 
a predicting portion (336) for producing a plurality of predictive images of respective decodable uni\^'reas or 
further-divided subareas by applying different predicting methods to each of the unit areas or each of the 
subareas in the process of the mot ion -compensated interframe prediction and then outputting the produced 
predictive images; and 

an area-prediction-mode selecting portion (337) for producing an adaptive predictive image from a plurality of 
he predictive images received from the predicting portion (336) according to the information received from 
the side-information decoding portion (335) and then outputting the produced predictive images. 

40. A video decoding device using an adaptive motion-compensated interframe-prediction system, comprising: 

a side-information decoding potion (335) for decoding coded side-information including area-informatbn pre- 
diction mode information and motion-vectors, and so on and outputting the decoded side-information a pre- 
dicting portion (336) for producing a plurality of predictive images of respective decodable unit-area or'further 
divided subareas by using a block-displacement (overlapped motion-compensative) predicting method an 
affine transform predicting method, bilinear transform predicting method and a background predicting method 
etc. in the process of the motion-compensated interframe compensated interframe prediction and then out- 
putting the produced predictive images; and 

an area-prediction-mode selecting portion (337) for producing an adaptive predictive image from a plurality of 
he predictive images received from the predicting portion (336) according to the information received from 
the side-information decoding portion (335) and outputting the produced predicted images. 

41. A video decoding device using an adaptive motion-compensated interframe prediction system, which Is provided 
with a predicting portion (336) for diagonally dividing each decodable unit-area into two or four subareas producinq 
a predictive .mage for each subarea by affine transformation In the process of the motion-compensated interframe 
prediction and outputting the produced predicted images. 

42. A video decoding device using an adaptive motion-compensated interframe prediction system, as defined in any 
one of claims 38. 39 and 40, characteraed in that it is provided with a predicting portion (336) for diagonally dividing 
each decodable unit-area into two or four subareas, producing a predictive image for each subarea by applying 
different prediction methods In the process of the motion^ompensated interframe prediction and then out a pre 
dictive image and outputting the produced predicted images. 
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FIG.13 
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